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Use of genes for increasing the oil content in plants 
Description 

5 The invention relates to the use of genes that when expressed will increase the total 
amount of oil (i.e. triacylglycerols, monoacylglycerols, fatty acids) that is produced in 
transgenic organisms. 

More specifically this invention describes the identification of genes encoding oil 
10 synthesis enhancing proteins (OEP). 

In a first embodiment this Invention is directed to the protein comprising an amino acid 
sequence as set forth in SEQ ID NO: 2 or a functional fragment, derivative, variant, 
or ortologue thereof. 

15 

The present invention further includes the nucleotide sequence as set forth in 
SEQ ID NO: 1 , as well as portions of the genomic sequence, the cDNA sequence, 
allelic variants, synthetic variants and mutants thereof. This includes sequences that 
are to be used as probes, vectors for transformation or cloning intermediates. 

20 

SEQ ID NO : 2 is the deduced amino acid sequence from the open reading frame 
YPR140W in SEQ ID NO : 1. 

In a second embodiment, this Invention is directed to the protein comprising an amino 
25 acid sequence as set forth in SEQ ID NO: 4 or a functional fragment, derivative, 
variant, or ortologue thereof. 

SEQ ID NO : 4 is the deduced amino acid sequence from the open reading frame 
AM40.1 in SEQ ID NO : 3. 

30 

In a third embodiment, this invention is directed to the protein comprising an amino acid 
sequence as set forth in SEQ ID NO: 6 or a functional fragment, derivative, variant, 
or ortologue thereof. 

35 SEQ ID NO : 6 is the deduced amino acid sequence from the open reading frame 
At140.2 in SEQ ID NO : 5. 

Another aspect of the present invention relates to those polypeptides, which have at 
least 60% identity to SEQ ID NO; 2, SEQ ID NO : 4 or SEQ ID NO : 6. 

40 
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The invention furthermore relates to expression constructs for expressing YPR140w, 
At140.1 or AI140.2 in plants, preferably in plant seeds, transgenic plants expressing 
YPR140w. AM40.1 or At140.2 and to the use of said transgenic plants for the produc- 
tion of food, feed, seed, pharmaceuticals or fine chemicals, in particular for 
5 the production of oils. 

In oil crops like rape, sunflower, oil palm etc., the oil (i.e. triacytglycerols) is the most 
valuable product of the seeds or fruits and other compounds such as starch, protein 
and fiber is regarded as by-products with less value. Enhancing the quantity of oil per 

10 weight basis at the expense of other compounds in oil crops would therefore increase 
the value of the crop. If proteins that promote the allocation of reduced carbon Into the 
production of oil can be up regulated by overexpression, the cells will accumulate more 
oil at the expense of other products. This approach could not only be used to increase 
the oil content in already high oil producing organisms such as oil crops, they could 

15 also lead to significant oil production in moderate or low oil containing crops such as 
soy, oat, maize, potato, sugar beats, and turnips as well as in microorganisms. 

Increasing the oil content in plants and, in particular, in plant seeds is of great interest 
for traditional and modem plant breeding and in particular for plant biotechnology. 
20 Owing to the increasing consumption of vegetable oils for nutrition or industrial appli- 
cations, possibilities of increasing or modifying vegetable oils are increasingly the 
subject of current research (for example Topfer et al. (1995) Science 268:681-686). Its 
aim is in particular increasing the fatty acid content in seed oils. 

25 The fatty acids which can be obtained from the vegetable oils are also of particular 
interest They are employed, for example, as bases for plasticizers, lubricants, 
surfactants, cosmetics and the like and are employed as valuable bases in the food 
and feed industries. Thus, for example, it is of particular interest to provide rapeseed 
oils with fatty acids with medium chain length since these are in demand in particular in 

30 the production of surfactants. With regard to medical ramifications, the long chain fatty 
acids (C18 and longer) found in many seed oils have been linked to reductions in 
hypercholesterolemia and other clinical disorders related to coronary heart disease 
(Brenner 1976, Adv. Exp. Med. Biol. 83:85-101). Therefore, consumption of a plant 
* having increased levels of these types of fatty acids may reduce the risk of heart 

35 disease. Enhanced levels of seed oil content also increase large-scale production of 
seed oils and thereby reduce the cost of these oils. 

The targeted modulation of plant metabolic pathways by recombinant methods allows 
the modification of the plant metabolism in an advantageous manner which, when 
40 using traditional breeding methods, could only be achieved after a complicated , 
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procedure or not at all. Thus, unusual fatty adds, for example specific polyunsaturated 
fatty acids, are only synthesized in certain plants or not at all in plants and can 
therefore only be produced by expressing the relevant gene in transgenic plants (for 
example Millar et al. (2000) Trends Plant Scl 5:95-101). 

5 

Triacylgylcerides, diacylglycerides. monoacylglycerides and other lipids are synthe- 
sized from fatty acids. Fatty acid biosynthesis and triacylglyceride biosynthesis can be 
considered as separate biosynthetic pathways owing to the compartmentalization, but 
as a single biosynthetic pathway in view of the end product Lipid synthesis can be 

10 divided into two part-mechanisms, one which might be termed "prokaryotc" 

and another which may be termed "eukaryotic" (Browse et al. (1986) Biochemical J 
235:25-31; Ohlrogge & Browse (1995) Plant Cell 7:957-970). The prokaryotic mecha- 
nism is localized in the plastids and encompasses the biosynthesis of the free fatty 
acids which are exported into the cytosol, where they enter the eukaryotic mechanism 

15 in the form of fatty acid acyt-CoA esters. In this pathway the fatty acids are esterified by 
glycerol-3-phosphate acyltransferase and lysophosphaBdic acid acyltransferase to the 
sn-1 and sn-2 positions of glycerol-3-phosphate, respectively, to yield phosphatide acid 
(PA). The PA is the precursor for other polar and neutral lipids, the latter being formed 
in the Kennedy pathway (Voelker 1996, Genetic Engineering ed.:Setlow 18:111-113; 

20 Shanklin & Cahoon 1998, Annu. Rev. Plant Physiol. Plant Mol. Biol. 49:61 1-641 ; 
Frentzen 1998, Lipids 100:161-166; Millar et al. 2000, Trends Plant Sci. 5:95-101). 

Patent WO 00/18889 describes nucleic acid sequences encoding for acyltransferase 
related proteins , wherein said acyltransferase-like protein is active in the transfer of a 
25 fatty acyl group from a fatty acyl donor to fatty acyl acceptor and the use of such 

sequences to provide transgenic plants capable of producing modified lipid content and 
composition based on sequence predictions only. Biochemical data supporting these 
claims is lacking so far. 

30 The Initial step of lipid production, the transfer of a fatty add from an acyl-CoA to 
glycerol-3-phosphate was shown to be in part but not exclusively performed by two 
yeast genes, Gat1 and Gat2 (Zeng andf Zou 2001, J Biol Chem. 276(45):41710-6). 
Patent WO 02/08391 relates to DNA sequences encoding these two novel glycerol-3- 
phosphate and dihydroxy acetone phosphate acyltransferase enzymes and methods 

35 for utilizing said polynudeotides for alteration of lipid content in higher cells, in particu- 
lar plant cells. However, no counterparts from plants could be found for Gat1 or Gat2. 

The last step in the synthesis of triacylglycerols has been shown to occur by two 
different enzymatic reactions, an acyl-CoA dependent reaction catalyzed by an acyl- 
40 CoA : diacylglycerol acyltransferase (Cases. S. et al., (1998) Proc. Natl. Acad. Sd., 
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USA 95, 13018-13023.; Lardizabal, et al. p 2001) and the acyl-CoA independent 
reaction catalyzed by an phospholipid : diacylglyerol acyltransferase (Dahlqvist, et a!.. 
2000). Two unrelated gene families encoding acyl-CoA : diacylglycerol acyltrans- 
ferases have been identified in plants, animals and yeast, whereas the gene family 
encoding the acyl-CoA independent enzyme has been identified in yeast but not in 
plants or animals. In yeast, a total of four genes (arel, are2, Iro1, dgal) belong to 
these three gene families, and they are the only genes known to contribute directly 
to triacylglycerol synthesis. Thus, no synthesis of. triacylglycerol could be detected 
in yeast cells where all four genes were disrupted. 

It is an object of the present invention to provide additional methods for increasing 
the oil content in plants. 

We have found that this object is achieved by the present invention. 

In the present invention we show, that another gene is present in yeast and more- 
over in plants, which enhances the amount of triacylglycerol and fatty acids that 
accumulates in wild type yeast. 

A first subject matter of the invention comprises a method of increasing the total oil 
content in a plant organism or a tissue, organ, part, cell or propagation material thereof, 
comprising 

a) the transgenic expression of SEQ ID NO: 1, SEQ ID NO : 3 or SEQ ID NO : S in 
said plant organism or in a tissue, organ, part, cell or propagation material 
thereof, and 

b) the selection of plant organisms In which - in contrast to or comparison with 
the starting organism - the total oil content in said plant organism or in a tissue, 
organ, part, cell or propagation material thereof is increased. 

Other proteins resulting in the same effect as the proteins set forth in 
SEQ ID NO • 2, SEQ ID NO : 4 or SEQ ID NO : 6 (OEPs) are obtainable from 
the specific sequences provided herein. Furthermore, it will be apparent that one 
can obtain natural and synthetic OEPs, including those with modified amino acid 
sequences and starting materials for synthetic-protein modeling from the exempli- 
fied OEPs and from OEPs which are obtained through the use of such exempted 
sequences. Modified amino acid sequences include sequences that have been 
mutated, truncated, increased and the like, whether such sequences were partially 
3 or wholly synthesized. 
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Further, the nucleic acid probes (DNA or RNA) derived from the SEQ ID NO : 1, 
SEQ ID NO : 3 or SEQ ID NO : 5 of the. present Invention can be used to screen and 
recover homologous or related sequences from a variety of plant and microbial 
sources. 

The over-expression of SEQ ID NO : 3 or SEQ ID NO : 5 in yeast or plants increases 
the content in total lipids and enhances the amount of triacylglycerol and fatty acids 
that accumulates compared to wildtype yeasts or plants. The amount of polar lipids 
remained unchanged but a reduced amount of palmitic acid was observed. On the 
other hand the amount of oleic acids was increased. 



The present invention can be characterized by the following aspects: 



Example 1 

shows the reduction of GPAT activity in yeast cells lacking the YPR140w gene. 



Example 2 

shows the increased incorporation of glycerol-3-phosphate into lipids in crude extracts 
of yeast over expressing the YPR140w gene. 



Example 3 

shows the increased accumulation of triacylglycerol in yeast cells expressing the 
YPR140w gene from a strong promoter. 



Example 4 

describes the identification of genes homologous to YPRUOw in plants. 



Example 5 

shows the increased accumulation of triacylglycerol in yeast cells expressing the 
At140.1 or AH40.2 gene from a strong promoter. 

s^Ta significantly higher total oil content in the seeds of transgenic plant lines with 
increased expression of the AM40.1 or At140.2 gene. 
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The invention can furthermore be characterized by: 

use of the nucleic acid sequences SEQ ID NO : 1, SEQ ID NO : 3 or SEQ ID NO ; 5 
encoding a protein SEQ ID NO : 2, SEQ ID NO : 4 or SEQ ID NO : 6 that enhances the 
5 production of triacylglycerol (TAG); 

genetic transformation of an oil-producing organism with one of said sequence in order 
to be expressed in this organism, resulting in an active protein that increases the oil 
content of the organism. 

10 

The nucleic acid sequence is derived from the sequence shown in SEQ ID NO : 1 
from the Saccharomyces cerevisiae YPR140w gene (genomic clone or cDNA), 
SEQ ID NO: 3 or SEQ ID NO: 5 from Arabidopsis thaliana cr from a nucleic acid 
sequence or cDNA that contains a nucleotide sequence coding for a protein with an 
15 amino acid sequence that is 60% or more identical to the amino acid sequences as 
presented in SEQ ID NO: 2 f SEQ ID NO: 4 or SEQ ID NO: 6. 

The gene products, which we refer to as an oil synthesis enhancing protein (OEP) is 
most likely not itself catalyzing the synthesis of TAG, but its presence elevates the 
20 amount of TAG synthesized. 

Furthermore the OEP as characterised by the protein sequence SEQ ID NO : 2, 
SEQ ID NO : 4 or SEQ ID NO : 6 results in an increased incorporation of ( 14 C) 
glycerof-3-phosphate into lipids as described in example 3 and 5. 

25 

The instant invention pertains to a gene construct comprising a said nucleotide 
sequence SEQ ID NO: 1, SEQ ID NO: 3 or SEQ ID NO: 5 of the instant invention, 
which is operably linked to a heterologous nucleic acid. 

30 The term operably linked means a serial organization e.g. of a promoter, coding 
sequence, terminator and/or further regulatory elements whereby each element can 
fulfill its original function during expression of the nucleotide sequence. 

Further, a vector comprising the said nucleotide sequence SEQ ID NO: 1, SEQ ID 
35 NO: 3 or SEQ ID NO: 5 of the instant invention is contemplated In the instant invention. 
This includes also an expression vector which can harbor a selectable marker gene 
and/or nucleotide sequences for the replication in a host cell and/or the integration into 
the genome of the host cell. 

40 Furthermore, this invention relates to a method for producing an OEP encoded by one 
of the nuleic acid sequences SEQ ID NO: 1. SEQ ID NO: 3 or SEQ ID NO: 5 in a host 
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cell or progeny thereof including genetically engineered oil seeds, yeast and moulds 
or any other oil-accumulating organism, via the expression of a construct in the cell. 
Of particular interest is the expression of the nucleotide sequences of the present 
invention from transcription initiation regions that are preferentially expressed in plant 
seed tissues, it is further contemplated that an artificial gene sequence encoding an 
OEP may be synthesized, especially to provide plant-preferred codons. Transgenic 
cells containing an OEP as a result of the expression of an OEP encoding sequence 
are also contemplated within the scope of the invention. 

Further, tine invention pertains a transgenic cell or organism containing a said nucleo- 
tide sequence and/or a said gene construct and/or a said vector. The object of the 
instant invention is further a transgenic cell or organism which is an eucaryotic cell or 
organism. Preferably, the transgenic cell or organism is a yeast cell or a plant cell or 
a plant. The instant invention further pertains said transgenic cell or organism having 
an increased biosynthetic pathway for the production of substrates for tine synthesis 
oftriacytglycerol. A transgenic cell or organism having increased oil content is also 
contemplated within the scope of this invention: 

Further, the invention pertains a transgenic cell or organism wherein the activity of an 
OEP encoded by one of the nuleic acid sequences SEQ ID NO: 1. SEQ ID NO: 3 or 
SEQ ID NO: 5 is increased in said cell or organism. The increased activity of OEP is 
characterized by an alteration in gene expression, catalytic activity and/or regulation 
of activity of the enzyme. Moreover, a transgenic cell or organism is included in the 
instant invention, wherein the increased biosynthetic pathway for the production of 
substrates for the production of triacylglycerol is characterized e.g. by the prevention 
of accumulation of undesirable fatty acids in the membrane lipids. 

In a different embodiment, this invention also relates to methods of using a DNA 
sequence SEQ ID NO: 1. SEQ ID NO: 3 or SEQ ID NO: 5 for increasing the oil-content 
within the cells of different organisms. 

Further, the invention makes possible a process for elevating the production of 
triacylglycerol, which comprises growing transgenic ceils or organisms under conditions 
whereby the nucleotide sequences SEQ ID NO: 1 , SEQ ID NO : 3 or SEQ ID NO : 5 
are expressed in order to produce a protein in these cells with the ability of enhancing 
the production of triacylglycerol. 

Corresponding genes coding for OEP can be isolated from other organisms, especially 
yeast-type organisms, like e.g. Schizosaccharomyces pombe. Yarrowia lipolytica. 
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Zygosaccharomyces rouxii, Saccharomyces cerevisiae, Emericella nidulans and 
Debaryomyces hansenih 

Furthermore the invention pertains transgenic organisms comprising, In their genome 
5 or on a plasmid, a nucleic acid sequence SEQ ID NO :1, SEQ ID NO : 3 or SEQ ID NO 
: 5 according to the above, transferred by recombinant DNA technology^ One important 
type of transgenic organism covered by this invention are commercially relevant plants 
in which said nucleotide sequence preferably would be expressed under the control of 
a storage organ specific promoter. Alternatively, the nucleotide sequence could also be 
1 0 expressed under the control of a seed-specific promoter or any other promoter suitable 
for tissue-specific high-level expression in plants. 

The invention also pertains a protein encoded by a DNA molecule according to 
SEQ ID NO : 1, SEQ ID NO: 3 or SEQ ID NO: 5 or a functional biologically active 
15 fragment thereof having OEP activity in transgenic organisms. Alternatively, the 
invention pertains a protein produced in an organism, which has the amino acid 
sequence set forth in SEQ ID NO : 2 f SEQ ID NO: 4 or SEQ ID NO: 6 or an amino 
acid sequence with at least 60% homology to said amino acid sequence having OEP 
activity. 

20 

The protein can be is isolated from Saccharomyces cerevisiae or Arabidopsis thaliana 
but also from other organisms. 

The invention additionally pertains the use of a protein according to SEQ ID NO: 2, 
25 SEQ ID NO: 4 or SEQ ID NO: 6 or derivatives of that protein having OEP activity for 
the increased production of triacyiglycerols, monoacylglycerol and/or fatty acids. 

Surprisingly, it has been found that the homologous expression of the corresponding 
genes of Arabidopsis thaliana SEQ ID NO: 3 or SEQ ID NO: 5 in Arabidopsis leads to a 
30 significantly increased triacyiglyceride (storage oils) content in the seeds as described 
in example 6. The oil content was increased by approximately 5%, in one transgenic 
line even by 10%, compared with wild-type control plants. The over-expression of the 
At140.1 or At140.2 genes had no adverse effects on the growth or other properties of 
the transformed plants. 

35 

Figure 1 shows the effect of deleting the YPR140w gene in yeast on in vitro lipid 
biosynthesis. The incorporation rate of [ 14 C]glycerol-3phosphate <[ 14 C]G3P) into lipids 
was analyzed In microsomal fractions of YPR140w-deletion and wt yeast (Saccharo- 
myces cerevisiae) strains. Yeast strains were grown in rich media to an optical density 
40 of 1 .5 (Asoo) and microsomal fractions were prepared. Enzymatic assays (n=4) were 
performed for 10 minutes at 30'C as described in Example 1. 
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The effect of over expressing gene YPR140W in yeast on the in vitro incorporation rate 
of [ 14 C]G3P into lipids is demonstrated in figure 2. The ana-lyses were performed with 
crude extracts of Saccharomyces cerevisiae transformed with pAN3, multicopy plasmid 
with the gene YPR140w behind the GAL1 promoter, or pYES2, empty plasmid control 
as described in Example 2. Data presented are average from 8 replicates. 

In figure 3 the total lipid content in the YPR140w yeast transformant harvested 
24 hours (Exp.#1; open bar) or 28 hours (Exp.#2; filled bar) after induction is shown- 
Yeast cells were transformed with the pAN3 vector expressing the YPRT40w from the 
GAL1 inducible promoter. Yeast celts transformed with empty vector (pYES2) were 
used as control. Total lipid content was determined as nmol fatty add (FA) per mg dry 
cell weight 

In figure 4 the total lipid content in the different yeast transformants harvested after 
24 hours is shown. Yeast cells were transformed with the pY-At140.1 or the pY-At140.2 
vector expressing the AM40.1 or AH40.2 from the GAL1 Inducible promoter. Yeast 
cells transformed with empty vector (pYES2) were used as control. Total lipid content 
was determined as nmol fatty acid (FA) per mg dry cell weight 

In figure 5 the total lipid content of dry T2 generation seeds from Arabldopsis plants 
overexpressing AH40.1 and At140>2 is shown. Wildtype Arabidopsis. plants were used 
as control. Total lipid content was determined by conventional gas chromatography as 
mg FA per mg dry seeds. 

The method according to the invention can be applied in principle to all plant species, 
in addition to the species Arabidopsis thaliana, which Is employed as model plant. The 
method according to the invention is preferably applied to oil crops whose oil content is 
already naturally high and/or for the industrial production of oils. 

Plant organism or tissue, organ, part, cell or propagation material thereof is generally 
understood as meaning any single- or multi-celled organism or a cell, tissue, part or 
propagation material (such as seeds or fruit) of same which is capable of photo- 
synthesis. Included for the purpose of the invention are all genera and species of 
higher and lower plants of the Plant Kingdom. Annual, perennial, monocotyledonous 
and dicotyledonous plants are preferred. Also included are mature plants, seeds, 
shoots and seedlings, and parts, propagation material (for example tubers, seeds or 
fruits) and cultures derived from them, for example cell cultures or callus cultures. 
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Plant encompasses all annual and perennial monocotyledonous or dicotyledonous 
plants and includes by way of example, but not by limitation, those of the genera 
Cucurbita, Rosa, Vitis, Juglans, Fragaria, Lotus, Medicago, Onobrychls, Trifolium, 
Trigonella, Vigna, Citrus, Linum. Geranium, Manihot, Daucus, Arabidopsis, Brassica, 
5 Raphanus, Sinapis, Atropa, Capsicum, Datura, Hyoscyamus, Lycopersicon, Nicotiana, 
Solarium, Petunia, Digitalis, Majorana, Cichorium, Helianthus, Lactuca, Bromus, 
Asparagus, Antirrhinum, Heterocallis, Nemesis, Pelargonium, Panieum, Pennisetum. 
Ranunculus, Senecio, Salpiglossis, Cucumis, Browaalia, Glycine, Pisum, Phaseolus, 
Lolium, Oryza, Zea, Avena, Hordeum, Secale, Triticum, Sorghum, Plcea and Populus. 

10 

Preferred plants are those from the following plant families: Amaranthaceae, As- 
teraceae, Brassicaceae, Carophyllaceae, Chenopodlaceae, Compositae, Cruciferae, 
Cucurbitaceae, Labiatae, Leguminosae, Papilionoideae, Uliaceae, Linaceae, Malva- 
ceae, Rosaceae, Rubiaceae, Saxifragaceae, Scrophulariaceae, Solanaceae, Sterculi- 
15 aceae, Tetragoniaceae, Theaceae, Umbelliferae, 

Preferred monocotyledonous plants are selected in particular from the monocotyledon- 
ous crop plants such as, for example, the Gramineae family, such as rice, maize, 
wheat or other cereal species such as barley, millet and sorghum, rye, triticale or oats, 
20 and sugar cane, and all grass species. 

The invention is applied very particularly preferably to dicotyledonous plant organisms. 
Preferred dicotyledonous plants ere selected in particular from the dicotyledonous 
crop plants such as, for example, 

25 

Asteraceae such as Heliantus annuus (sunflower), tagetes or calendula and 
others, 

Compositae, especially the genus Lactuca, very particularly the species sativa 
30 (lettuce) and others, 

Cruciferae, particularly the genus Brassica, very particularly the species napus 
(oilseed rape), campestris (beet), oleracea cvTastie (cabbage), oleracea cv 
Snowball Y (cauliflower) and oleracea cv Emperor (broccoli) and other cabbages; 
35 and the genus Arabidopsis, very particularly the species thaliana, and cress 

or canola and others, 

Cucurbitaceae such as melon, pumpkin/squash or zucchini and others, 
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Leguminosae, particularly the genus Glycine, very particularly the species max 
(soybean), soya, and alfalfa, pea, beans or peanut and others, 

Rublaceae, preferably the subclass Lamiidae such as, for example Coffea 
arabica or Coffea liberica (coffee bush) and others, 

Solanaceae, particularly the genus Lycopersicon, very particularly the species 
esculentum (tomato), the genus Solanum, very particularly the species tubero- 
sum (potato) and melongena (aubergine) and the genus Capsicum, very particu- 
larly the genus annuum (pepper) and tobacco or paprika and others, 

Sterculiaceae, preferably the subclass Dilieniidae such as, for example, Theo- 
broma cacao (cacao bush) and others, 

Theaceae, preferably the subclass Dilieniidae such as, for example, Camellia 
sinensis or Thea sinensis (tea shrub) and others, . 



Umbelliferae, particularly the genus Daucus (very particularly the species carota 
(carrot)) and Apium (very particularly the spedes graveolens dulce (celeary)) and 
others; 

and linseed, cotton, hemp, flax, cucumber, spinach, carrot, sugar beet and the various 
tree, nut and grapevine species, in particular banana and kiwi fruit 

Also encompassed are ornamental plants, useful or ornamental trees, flowers, cut 
flowers, shrubs or turf plants which may be mentioned by way of example but not by 
limitation are angiosperms. bryophytes such as, for example, Hepaticae (liverworts) 
and Musci (mosses); pteridophytes such as ferns, horsetail and clubmosses; gymno- 
sperms such as conifers, cycades, ginkgo and Gnetatae; algae such as Chlorophy- 
ceae, Phaeophpyceae, Rhodophyceae, Myxophyceae, Xanthophyceae, Bacillariophy- 
ceae (diatoms) and Euglenophyceae. Plants within the scope of the invention comprise 
by way of example and not by way of limitation, the families of the Rosaceae such as 
rose, Ericaceae such as rhododendron and azalea, Euphorbiaceae such as poinsettias 
and croton, Caryophyllaceae such as pinks. Solanaceae such as petunias, Gesneri- 
aceae such as African violet, Balsaminaceae such as touch-me-not, Orchidaceae such 
as orchids, Iridaceae such as gladioli, iris, freesia and crocus, Compositae such as 
marigold. Geraniaceae such as geranium. Uliaceae such as dracena, Moraceae such 
as ficus, Araceae such as cheeseplant and many others. 
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Furthermore, plant organisms for the purposes of the invention are further organisms 
capable of being photosynthetically active such as, for example, algae, cyanobacteria 
and mosses. Preferred algae are green algae such as, for example, algae from 
the genus Haematococcus. Phaedactylum tricornatum, Volvox or Dunaliella. Syne- 
5 chocystis is particularly preferred. 

Most preferred are oil crops. Oil crops are understood as being plants whose oil 
content Is already naturally high and/or which can be used for the industrial production 
of oils. These plants can have a high oil content and/or else a particular fatty acid 

10 composition which is of interest industrially. Preferred plants are those with a lipid 

content of at least 1 % by weight Oil crops encompassed by way of example: Bon/ago 
officinalis (borage); Brassica species such as B. campestris, B, napus, B. rapa 
(mustard, oilseed rape or turnip rape); Cannabis sativa (hemp); Carthamus tinctorius 
(safflower); Cocos nucifera (coconut); Crambe abyssinica (crambe); Cuphea species 

1 5 (Cuphea species yield fatty acids of medium chain length, in particular for industrial 
applications); Elaeis guinensis (African oil palm); Elaeis oleifera (American oil palm); 
Glycine max (soybean); Gossypium hirisutfum (American cotton); Gossypium bar- 
badense (Egyptian cotton); Gossypium herbaceum (Asian cotton); Helianthus annuus 
(sunflower); Linum usitatissimum (linseed or flax); Oenothera biennis (evening 

20 primrose); O/ea europaea (olive); Oryza sativa (rice); Ricinus communis (castor); 
Sesamum indicum (sesame); Triticum species (wheat); Zea mays (maize), and 
various nut species such as, for example, walnut or almond. 

Total oil content" refers to the sum of all oils, preferably to the sum of the triacylgly- 
25 cerides. 
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-Oils- encompasses neutra! and/or polar lipids and mixtures of these. Those mentioned 
in Table 1 may be mentioned by way of example, but not by limitation. 



Neutrale lipids 


Tnacylglycerol (TAG) 




Diacylglycerol (DAG) 




Monoacylglycerol (MAG) 






Polar lipids 


Monogalactosyidiacyigiycerol (MGOG) 




uigaiactosyldiacylglycerol(DGDG) 




Phosphatidylglycerol (PG) 




Phosphatidylcholine (PC) . 




Phosphatidylethanolamine (PE) 




Phosphatidylinositol (PI) 




Phosphatidylserine (PS) 




sujtoquinovosyldiacylglycerol 



Table 1 : Classes of plant lipids 

Neutral lipids preferably refers to triacylglycerides. Both neutral and polar lipids may 
comprise a wide range of various fatty acids. The fatty acids mentioned in Table 2 
may be mentioned by way of example, but not by limitation. 



Nomenclature 1 


Name 


16:0 


Palmitic acid' 


16:1 


Palmitoleic acid 


16:3 


Roughanic acid 


18:0 


Stearic acid 


18:1 


Oleic acid 


18:2 


Linoleic acid 


18:3 


Linolenic acid 


Y-18:3-18:3 


Gamma-linolenic acid * 


20:0 


Aracnidic acid 


22:6 


Docosahexaenoic acid (DHA) * 


20:2 


Eicosadienolc acid 


20:4 


Arachidonic acid (AA) * 


20:5 n 


Eicosapentaenoic acid (EPA) * 


22:1 


Erucic acid 



Table 2: Overview over various fatty acids (selection) 
1 0 1 Chain length: number of double bonds 

* not naturally occurring in plants 
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Oils preferably relates to seed oils. 

"Increase in* the total oil content refers to the increased oil content in a plant or a part 
5 tissue or organ thereof, preferably in the seed organs of the plants. In this context, the 
oil content is at least 5%, preferably at least 10%, particularly preferably at least 15%. 
very particularly preferably at least 20%, most preferably at least 25% increased under 
otherwise identical conditions in comparison with a starting plant which has not been 
subjected to the method according to the invention, but is otherwise unmodified. 
10 Conditions in this context means all of the conditions which are relevant for germina- 
tion, culture or growth of the plant, such as soil conditions, climatic conditions, light 
conditions, fertilization, Irrigation, plant protection treatment and the like. 

"YPR140w° generally refers to all those proteins which are capable of increasing the oil 
15 content in oil producing organims, especially microorganisms, yeast, fungi and plants 
and are identical to SEQ ID NO: 2 or have homology to SEQ ID NO: 2. 

Yeast refers to the group of unicellular fungi with a pronounced cell wall and formation 
of pseudomyceiium (in contrast to molds). They reproduce vegetatively by budding 

20 and/or fission (Schizosaccharomyces and Saccharomycodes, respectively). 

Encompassed are what are known as false yeasts, preferably the families Crypto- 
coccaceae, Sporobolomycetaceae with the genera Cryptococcus, Torulopsis, Pityro- 
sporum, Brettanomyces, Candida, Kloeckera, Trigonopsis, Trichosporon, Rhodotorula 
and Sporobolomyces and Bullera, and true yeasts (yeasts which also reproduce 

25 sexually; ascus), preferably the families Endo- and Saccharomycetaceae, with the 

genera Saccharomyces, Debaromyces, Lipomyces, Hansenula, Endomycopsis, Pichia, 
Hanseniaspora. Most preferred are the genera Saccharomyces cerevisiae, Pichia 
pastoris, Hansenula polyrnorpha, Schizosaccharomyces pombe, Kluyveromyces lactls, 
Zygosaccharomyces rouxii, und Yarrowia lipolltica, Emericella nidulans, Aspergillus 

30 nidulans. Debaryomyces hansenii and Torulaspora hansenii. 

YPR140W refers in particular to the polypeptide sequence SEQ ID NO : 2. 

Most preferably, YPR140W refers to the yeast protein OEP as shown in SEQ ID NO : 2 
35 and functional equivalents or else functionally equivalent portions of the above. 

tt At140.r and °At140.2 n refer in particular to the polypeptide sequence SEQ ID NO : 4 
and SEQ ID NO : 6. 
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Most preferably, At140.1 and AH40.2 refer to the Arabidopsis thaliana protein OEP 
as shown in SEQ ID NO : 4 and SEQ ID NO : 6 and functional equivalents or else 
functionally equivalent portions of Vie above. 

5 Functional equivalents refers in particular to natural or artificial mutations of the yeast 
protein OEP as shown in SEQ ID NO : 2 and. homologous polypeptides from other 
yeasts which have the same essential characteristics of YPR140w as defined above. 
Mutations encompass substitutions, additions, deletions, inversions or insertions of one 
or more amino acid residues. 

10 

The YPR140W to be employed advantageously within the scope of the present 
invention can be found readily by database searches or by screening gene or cDNA 
libraries using the polypeptide sequence shown in SEQ ID NO : 2, which is given by 
way of example, or the nucleic acid sequence as shown in SEQ ID NO : 1, which 
1 5 encodes the latter, as search sequence or probe. 

Said functional equivalents preferably have at least 60%, particularly preferably at least 
70%, particularly preferably at least 80%, most preferably at least 90% homology with 
the protein of SEQ ID NO: 2. 

20 

Functional equivalents refer in particular to natural or artificial mutations of the Arabi- 
dopsis thaliana OEP as shown in SEQ ID NO: 4 or SEQ ID NO : 6 and homologous 
polypeptides from other plants which have the same essential characteristics of an 
.OEP as defined above. Mutations encompass substitutions, additions, deletions. 
25 inversions or insertions of one or more amino acid residues. 

The nucleic acid to be employed advantageously within the scope of the present 
invention can be found readily by database searches or by screening gene or cDNA 
libraries using the polypeptide sequences shown in SEQ ID NO : 4 or SEQ ID NO : 6, 
30 which is given by way of example, or the nucleic acid sequence as shown in SEQ ID 
NO : 3 and SEQ ID NO : 5, which encode the latter, as search sequence or probe. 

Furthermore the following nucleic acid sequences can be used in order to identify 
and clone genes encoding an OEP from plant organisms and having at least 60% 
35 homology to SEQ ID NO : 3 or SEQ ID NO : 5 : 

Arabidopsis thaliana ESTs from GenBank: AV782272, AV821735, AV825509, 
AI996105. AV791433. BE528444 
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Glycine max ESTs from GenBank: AW458589. BQ740950, BI893252, BG1 54651, 
AW507932 

Rose Petals EST from GenBank: BQ1 05746 

5 

Rice EST from GenBank: AU 166240 
Tomato ESTs from GenBank: BG 123776, AW218643 
10 Medicago truncatula ESTs from GenBank: BG644974, CA858351 
Gossypium arboreum EST from GenBank: BQ403497 
Potato EST from GenBank: BE342183 

15 

Furthermore the nucleic acid sequences SEQ ID NO : 7 to SEQ ID NO : 35 can be 
used in order to identify and clone genes encoding an OEP from plant organisms 
having at least 60 % homology to SEQ ID NO : 3 or SEQ ID NO : 5 . 

20 Said functional equivalents have at least 60%, preferably at least 70%, particularly 
preferably at least 80%, most preferably at least 90% homology with the protein of 
SEQ ID NO : 4 or SEQ ID NO : 6. 

The functional equivalent of SEQ ID NO : 2, SEQ ID NO : 4 or SEQ ID NO : 6 has 
25 an identity of at least 50%, 51%, 52%, 53%. 54%, 55%, 56%, 57% preferably at least 

58%, 59%, 60%, 61%. 62%, 63%. 64%, 65%. 66%. 67%, 68%. 69%, and 70% more 

preferably 71%, 72%, 73%, 74%, 75%, 76%, 77%. 78%. 79%, 80%. 81%, 82%, 83%. 

84%, 85% most preferably at least 86%, 87%. 88%, 89%, 90%, 91%, 92%. 93%, 94%. 

95%, 96%, 97%, 98%, 99% identity with the SEQ ID NO : 2. SEQ ID NO : 4 or SEQ ID 
30 NO : 6. 

Homology between two polypeptides is understood as meaning the identity of the 
amino acid sequence over the entire sequence length which is calculated by compari- 
son with the aid of the program algorithm GAP (Wisconsin Package Version 10.0, 
35 University of Wisconsin, Genetics Computer Group (GCG), Madison, USA), setting the 
following parameters: 

Gap Weight: 8 . Length Weight: 2 



40 



Average Match: 2,912 Average Mismatch: -2,003 
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For example, a sequence with at least 80% homology with the sequence SEQ ID NO: 2 
at the protein level is understood as meaning a sequence which, upon comparison with 
the sequence SEQ ID NO: 2 with the above program algorithm and the above para- 
meter set has at least 80% homology. 

5 

Functional equivalents - for example - also encompass those proteins which are 
encoded by nucleic acid sequences which have at least 60%, particularly preferably at 
least 70%, particularly preferably at least 80%, most preferably at least 90% homology 
with the nucleic acid sequence with the SEQ ID NO: 1. 

10 

Homology between two nucleic acid sequences is understood as meaning the identity 
of the two nucleic acid sequences over the entire sequence length which is calculated 
by comparison with the aid of the program algorithm GAP (Wisconsin Package Version 
1 0.0, University of Wisconsin, Genetics Computer Group (GCG), Madison, USA), 
1 5 setting the following parameters: 

Gap Weight: 50 Length Weight: 3 

Average Match: 10 Average Mismatch:0 

20 

For example, a sequence which has at least 80% homology with the sequence 
SEQ ID NO: 1 at the nucleic acid level is understood as meaning a sequence which, 
upon comparison with the sequence SEQ ID NO: 1 within the above program algorithm 
with the above parameter set has a homology of at least 80%. 

25 

Functional equivalents also encompass those proteins which are encoded by nucleic 
acid sequences which hybridize under standard conditions with a nucleic acid se- 
quence described by SEQ ID NO: 1, SEQ ID NO: 3 or SEQ ID NO: 5 t the nucleic acid 
sequence which is complementary thereto or parts of the above and which have the 
30 essential characteristics of an OEP as characterised by SEQ ID NO : 2, SEQ ID NO: 4 
or SEQ ID NO: 6. 

Natural examples of OEPs and the corresponding genes can furthermore readily be 
found in various organisms, in particular plants, whose genomic sequence is unknown, 
35 by hybridization techniques in a manner known per se, for example starting from the 
nucleic acid sequences SEQ ID NO : 1 , SEQ ID NO : 3 or SEQ ID NO : 5. 

The hybridization may be carried out under moderate (low stringency) or. preferably, 
under stringent (high stringency) conditions. 

40 
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Such hybridization conditions are described, inter alia, in Sambrook, J., Fritsch, E.F., 
Maniatis, T., in: Molecular Cloning (A Laboratory Manual), 2nd edition. Cold Spring 
Harbor Laboratory Press, 1989, pages 9.31-9.57 or in Current Protocols in Molecular 
Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. 



By way of example, the conditions during the washing step may be selected from the 
range of conditions which is limited by those with low stringency (with 2X SSC at 50°C) 
and those with high stringency (with 0.2X SSC at 50°C, preferably at 65°C) 
(20X SSC: 0.3 M sodium citrate, 3 M sodium chloride, pH 7.0). 



In addition, the temperature may be raised during the washing step from moderate 
conditions at room temperature, 22°C, to stringent conditions at 65°C. 

Both parameters, salt concentration and temperature, may be varied simultaneously 
15 and it is also possible to keep one of the two parameters constant and to vary only the 
other one. It is also possible to use denaturing agents such as, for example, fomnamide 
or SDS during hybridization. In the presence of 50% formamide, the hybridization is 
preferably carried out at 42°C. 

20 Some exemplary conditions for hybridization and washing step are listed below: 

(1.) hybridization conditions with, for example 



5 



10 



(i)4XSSCat65 6 C, or 



25 



(ii)6XSSCat45°C,or 



(iii) 



6X SSC at 68°C, 100 mg/ml denatured fish sperm DNA, or 



30 



(iv) 



6X SSC, 0.5% SDS, 100 mg/ml denatured fragmented salmon sperm 
DNAat68**C, or 



(v) 



6X SSC, 0.5% SDS, 100 mg/ml denatured fragmented salmon sperm 
DNA, 50% formamide at 42°C, or 



35 



(vi) 



50% formamide, 4X SSC at 42°C, or 



40 



(vii) 



50% (vol/vol) formamide, 0.1% bovine serum albumin. 0.1% 
Ficoll, 0.1% polyvinylpyrrolidone, 50 mM sodium phosphate buffer 
pH 6.5, 750 mM NaCI, 75 mM sodium citrate at 42°C, or 
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(viii) 2X or 4X SSC at 50°C (moderate conditions), or 

(ix) 30 to 40% formamide. 2X or 4X SSC at 42°C (moderate conditions). 
5 (2.) Washing steps of 10 minutes each with, for example 



(i) 


0.015 M NaCI/0.0015 M sodium citrate/0.1% SDS at 50"C, or 


(ii) 


0.1XSSCat65'C, or 


(iii) 


0.1X SSC, 0.5% SDS at 68°C, or 


(iv) 


0.1X SSC, 0.5% SDS, 50% formamide at42°C, or 


(v) 


0.2X SSC, 0.1% SDS at42°C, or 


(vi) 


2X SSC at 65°C (moderate conditions). 



The invention furthermore relates to transgenic expression constructs which can 
20 ensure a transgenic expression of an OEP as characterised by SEQ ID NO : 2, SEQ ID 
NO: 4 or SEQ ID NO: 6 in a plant organism or a tissue, organ, part, cells or propagation 
material of said plant organism. 

The definition given above applies to an OEP, with the transgenic expression of an 
25 OEP described by the sequence with the SEQ ID NO; 4 or SEQ ID NO: 6 being 
particularly preferred. 

In said transgenic expression constructs, a nucleic acid molecule encoding an OEP is 
preferably in operable linkage with at least one genetic control element (for example a 
30 promoter) which ensures expression in a plant organism or a tissue, organ, part, cell or 
propagation material of same. 

Especially preferred are transgenic expression cassettes wherein the nucleic acid 
sequence encoding an OEP is described by 

35 

a) a sequence SEQ ID NO : 1, SEQ ID NO: 3 or SEQ ID NO: 5 



b) a sequence derived from a sequence SEQ ID NO : 1 , SEQ ID NO: 3 or SEQ ID 
NO; 5 in accordance with the degeneracy of the genetic code 

40 
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c) a sequence which has at least 60% identity with a sequence SEQ ID NO : 1 , 
SEQ !D NO: 3 or SEQ ID NO: 5. 

Operable linkage is understood as meaning, for example, the sequential arrange- 
5 ment of a promoter with the nucleic acid sequence encoding an OEP which is to be 
expre-ssed (for example the sequence as shown in SEQ ID NO: 1 , SEQ ID NO: 3 or 
SEQ ID NO: 5 and, if appropriate, further regulatory elements such as, for example, 
a terminator in such a way that each of the regulatory elements can fulfil its function 
when the nucleic acid sequence is expressed recombinant^. Direct linkage in the 

10 chemical sense is not necessarily required for this purpose. ^Genetic control sequences 
such as, for example, enhancer sequences can also exert their function on the target 
sequence from positions which are further re-moved or indeed from other DNA 
molecules. Preferred arrangements are those in which the nucleic acid sequence to 
be expressed recombinantly is positioned behind the sequence acting as promoter so 

1 S that the two sequences are linked covalently to each other. The distance between the 
promoter sequence and the nucleic acid se-quence to be expressed recombinantly is 
preferably less than 200 base pairs, particularly preferably less than 100 base pairs, 
very particularly preferably less than 50 base pairs. 

20 Operable linkage and a transgenic expression cassette can both be effected by 

meansof conventional recombination and cloning techniques as they are described, 
for ex-ample, in Maniatis T, Fritsch EF and Sambrook J (1989) Molecular Cloning: A 
Labora-tory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor (NY), in 
Silhavy TJ, Berman ML und Enquist LW (1984) Experiments with Gene Fusions, 

25 Cold Spring Harbor Laboratory, Cold Spring Harbor (NY), in Ausubel FM et ah (1987) 
Current Protocols in Molecular Biology, Greene Publishing Assoc. and Wiley Inter- 
science and in Gelvin et al. (1990) In: Plant Molecular Biology Manual. However, 
further sequences which, for example, act as a linker with specific cleavage sites for 
restriction enzymes, or of a signal peptide, may also be positioned between the two 

30 sequences. Also, the insertion of sequences may lead to the expression of fusion 
proteins. Preferably, the expression cassette composed of a promoter linked to a 
nucleic acid sequence to be expressed can be in a vector-integrated form and can be 
inserted into a plant genome, for example by transformation. 

35 However, a transgenic expression cassette is also understood as meaning those con- 
structs where the nucleic acid sequence encoding an OEP is placed behind an endo- 
genous plant promoter in such a way that the latter brings about the expression of the 
OEP. 
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Promoters which are preferably introduced into the transgenic expression cassettes are 
those which are operable in a plant organism or a tissue, organ, part, cell or propaga- 
tion material of same. Promoters which are operable in plant organisms is understood 
as meaning any promoter which is capable of governing the expression of genes, in 
5 particular foreign genes, in plants or plant parts, plant cells, plant tissues or plant 
cultures. In this context, expression may be, for example, constitutive, inducible or 
development-dependent 

The following are preferred: 

10 

a) Constitutive promoters 

"Constitutive" promoters refers to those promoters which ensure expression in a 
large number of. preferably all, tissues over a substantial period of plant deveiop- 
15 ment, preferably at all times during plant development (Benfey et al.(1989) 

EMBO J 8:2195-2202). A plant promoter or promoter originating from a plant 
virus is especially preferably used. The promoter of the CaMV (cauliflower 
mosaic virus) 35S transcript (Franck et al. (1980) Cell 21:285-294; Odell et al. 
(1985) Nature 313:810-812; Shewmaker et al. (1985) Virology 140:281-288; 
20 Gardner et al. (1986) Plant MoJ Biol 6:221 - 228) or the 19S CaMV promoter 

(US 5,352,605; WO 84/02913; Benfey etal. (1989) EMBO J 8:2195-2202) are 
especially preferred. Another suitable constitutive promoter is the Rubisco small 
subunit (SSU) promoter (US 4.962,028). the leguminB promoter (GenBank Acc. 
No. X03677), the promoter of the nopalin synthase from Agrobacterium, the TR 
25 dual promoter, the OCS (octopine synthase) promoter from Agrobacterium, the 

ubiquitin promoter (Holtorf S et al. (1995) Plant Mol Biol 29:637-649), the ubi- 
quitin 1 promoter (Christensen etal. (1992) Plant Mol Biol 18:675-689; Bruce 
et al. (1989) Proc Natl Acad Sci USA 86:9692-9696). the Smas promoter, the 
cinnamyl alcohol dehydrogenase promoter (US 5.683,439), the promoters of the 
30 vacuolar ATPase subunits, the promoter of the Arabidopsis thaliana nitrilase-1 

gene (GenBank Acc. No.: U38846, nucleotides 3862 to 5325 or else 5342) or 
the promoter of a proline-rich protein from wheat (WO 91 /1 3991 ), and further 
promoters of genes whose constitutive expression in plants is known to the 
skilled worker. The CaMV 35S promoter and the Arabidopsis thaliana nitrilase-1 
35 promoter are particularly preferred. 

b) Tissue-specific promoters 

Furthermore preferred are promoters with specificities for seeds, such as, for 
40 example, the phaseolin promoter (US 5,504,200; Bustos MM et al. (1989) Plant 
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Cell 1(9):839-53). the promoter of the 2S albumin gene (Joseffson LG et ah 
(1987) J Biol Chem 262:12196- 12201), the legumine promoter (Shirsat A et al. 
(1989) Mol Gen Genet 215(2):326-331), the USP (unknown seed protein) pro- 
moter (Baumlein H et al. (1991) Mol Gen Genet 225(3):459-67), the napin gene 
promoter (US 5,608,152; Stalberg K et al. (1996) L Planta 199:515-519). the 
pro-moter of the sucrose binding proteins (WO 00/26388) or the legumin B4 pro- 
moter (LeB4; Baumlein H et al. (1991) Mol Gen Genet 225: 121-128; Baumlein 
et al. (1992) Plant Journal 2(2):233-9; Redler U et al. (1995) Biotechnology 
(NY) 13{10):1090f)r the Arabidopsis oleosin promoter (WO 98/45461), and the 
Brassica Bce4 promoter (WO 91/13980). 

Further suitable seed-specific promoters are those of the gene encoding high* 
molecular weight glutenin (HMWG), gliadin, branching enyzme, ADP glucose 
pyrophosphatase (AGPase) or starch synthase. Promoters which are furthermore 
preferred are those which permit a seed-specific expression in monocots such as 
maize, barley, wheat rye, rice and the like. The promoter of the Ipt2 or Ipt1 gene 
(WO 95/15389, WO 95/23230) or the promoters described in WO 99/16890 
(promoters of the hordein gene, the glutelin gene, the oryzin gene, the prolamin 
gene, the gliadin gene, the glutelin gene, the zeln gene, the casirin gene or the 
secalin gene) can advantageously be employed, 

c) Chemically inducible promoters 

The expression cassettes may also contain a chemically inducible promoter 
(review article: Gatz et al. (1997) Annu Rev Plant Physiol Plant Mol Biol 
48:89-108), by means of which the expression of the exogenous gene in the 
plant can be controlled at a particular point in time. Such promoters such as, for 
example, the PRP1 promoter (Ward et al. (1993) Plant Mol Biol 22:361-366), a 
salicylic acid-inducibie promoter (WO 95/19443), a benzenesulfonamide-indu- 
cible promoter (EP 0 388 186), a tetracyclin-inducible promoter (Gatz et al. 
(1992) Plant J 2:397-404), an abscisic acid-inducible promoter EP 0 335 528) or 
an ethanoi-cyciohexanone-inducible promoter (WO 93/21334) can likewise be 
used. Also suitable is the promoter of the glutathione-S transferase isoform II 
gene (GST-ll-27), which can be activated by exogenously applied safeners such 
as, for example, N.N-diallyl-^-dichioroacetamide (W0 93/01294) and which is 
operable in a large number of tissues of both monocots and dicots. 

Particularly preferred are constitutive promoters, very particularly preferred seed- 
specific promoters, in particular the napin promoter and the USP promoter. 



BASF Plant Science GmbH 20020908 LZ'L\ ' J <JV9 L8) iez89uej.d«3 

23 

In addition, further promoters which make possible expression in further plant tissues 
or in other organisms such as, for example, Ecoli bacteria, may be linked operabty with 
the nucleic acid sequence to be expressed. Suitable plant promoters are, in principle, 
all of the above-described promoters. 

The nucleic acid sequences present in the transgenic expression cassettes according 
to the invention or transgenic vectors can be linked opsrably with further genetic control 
sequences besides a promoter. The term genetic control sequences is to be under- 
stood in the broad sense and refers to all those sequences which have an effect on the 
establishment or the function of the expression cassette according to the invention. 
Genetic control sequences modify, for example, transcription and translation in pro- 
karyotic or eukaryotic organisms. The transgenic expression cassettes according to the 
invention preferably encompass a plant-specific promoter 5'-upstream of the nucleic 
acid sequence to be expressed recombinant^ in each case and, as additional genetic 
control sequence, a terminator sequence 3'-downstream, and, if appropriate, further 
customary regulatory elements, in each case linked operably with the nucleic acid 
sequence to be expressed recombinantly. 

Genetic control sequences also encompass further promoters, promoter elements or 
minimal promoters capable of modifying the expression-controlling properties. Thus, 
genetic control sequences can, for example, bring about tissue-specific expression 
which is additionally dependent on certain stress factors. Such elements are, for ex- 
ample, described for water stress, abscisioacid (Lam E and Chua NH, J Biol Chem 
1991; 266(26): 17131 -17135) and thermal stress (Schoffl F et aL (1989) Mol Gen 
Genetics 217(2-3):246-53). 

Further advantageous control sequences are, for example, in the Gram-positive 
pro-moters amy and SP02. and in the yeast or fungal promoters ADC1, MFa, AC, 
P-60, CYC1 , GAPDH, TEF, rp28, ADH. 

In principle all natural promoters with their regulatory sequences like those mentioned 
above may be used for the method according to the invention. In addition, synthetic 
promoters may also be used advantageously. 

Genetic control sequences further also encompass the 5'-untranslated regions, introns 
or nonencoding 3'-region of genes, such as, for example, the actin-1 intron. or the 
Adh1-S intron 1, 2 and 6 (for general reference, see: The Maize Handbook. Chapter 
1 16, Freeling and Walbot, Eds., Springer, New York (1994)), It has been demonstrated 
that these may play a significant role in regulating gene expression. Thus, it has been 
demonstrated that S'-untranslated sequences can enhance the transient expression of 
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heterologous genes. Translation enhancers which may be mentioned by way of 
ex-ample are the tobacco mosaic virus 5* leader sequence (Gallie et al. (1987) Nucl 
Acids Res 15:8693-871 1) and the like. They may furthermore promote tissue specificity 
(Rouster J et al. (1998) Plant J 15:435^440). 

5 

The transient expression cassette can advantageously contain one or more of what are 
known as enhancer sequences in operable linkage with the promoter, and these make 
possible an increased recombinant expression of the nucleic acid sequence. Additional 
advantageous sequences such as further regulatory elements or terminators may also 
10 be inserted at the 3' end of the nucleic acid sequences to be expressed recombinantiy 
One or more copies of the nucleic acid sequences to be expressed recombinanly may 
be present in the gene construct 

Polyadenylation signals which are suitable as control sequences are plant poly- 
15 adenylation signals, preferably those which correspond essentially to Agrobacterium 
tumefaciens T-DNA polyadenylation signals, in particular those of gene 3 of the T-DNA 
(octopine synthase) of the Ti plasmid pTiACHS (Gielen et al. (1984) EMBO J 3:835 
et seq.) or functional equivalents thereof. Examples of particularly suitable terminator 
sequences are the OCS (octopin synthase) terminator and the NOS (nopaline syn- 
20 thase) terminator. 

Control sequences are furthermore understood as those which make possible homo- 
logous recombination or insertion into the genome of a host organism, or removal from 
the genome. In the case of homologous recombination, for example, the coding 

25 sequence of the specific endogenous gene can be exchanged in a directed fashion for 
a sequence encoding a dsRNA Methods such as the cre/lox technology permit the 
tissue-specific, possibly inducible, removal of the expression cassette from the genome 
of the host organism (Sauer B (1998) Methods. 14(4):381-92). Here, certain flanking 
sequences are added to the target gene (lox sequences), and these make possible 

30 removal by means of ere recombinase at a later point in time. 

A recombinant expression cassette and the recombinant vectors derived from it may 
comprise further functional elements. The term functional element is to be understood 
in the broad sense and refers to all those elements which have an effect on generation, 
35 replication or function of the expression cassettes, vectors or transgenic organisms 
according to the invention. Examples which may be mentioned, but not by way of 
limitation, are:. 



40 



a) Selection markers which confer resistance to a metabolism inhibitor such as 
2-deoxyglucose-6-phosphate (WO 98/45456), antibiotics or biocides, preferably 



10 



15 
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herbicides, such as. for example, kanamydn. G 418, bleomycin, hygromycin or 
Phosph nothricin and the like. Particularly preferred selection markers are those 
which confer resistance to herbicides. The following maybe mentioned by way of 
example: DMA sequences which encode phosphinothricin acetyltransferases 
(PAT) and which inactivate glutamine synthase inhibitors (barand pat gene) 
S^nolpymvylshikimate^-phosphate synthase genes (EPSP synthase genes) 
which confer resistance to Glyphosate Wphosphonomethyl)giyc.ne), the gox 
gene, which encodes Glyphosate-degrading enzyme (Glyphosate oxidoreduc- 
tase). the deh gene (encoding a dehalogenase which inactivates dalapon) sul- 
fonylurea- and imidazolinone-inactivating acetolactate synthases, and bxn genes 
wh.ch encode nitrilase enzymes which degrade bromoxyr.il, the aasa gene which 
confers resistance to the antibiotic apectinomycin. the streptomycin phos- ' 
photransferase (SPT) gene, which permits resistance to streptomycin, the neo- 
mycin phosphotransferase (NPTII) gene, which confers resistance to kanamycin 
or geneticidin. the hygromycin phosphotransferase (HPT) gene, which confers 
resistance to hygromycin. the acetolactate synthase gene (ALS), which confers 
resistance to sulfonylurea herbicides (for example mutated ALS variants with for 
example, the S4 and/or Hra mutation). 

20 b) Reporter genes which encode readily quantifiable proteins and which allow the 
transformation efficacy or the expression site or time to be assessed via their 
color or enzyme activity. Very particularly preferred in this context are reporter 
proteins (Schenbom E, Groskreutz D. Mol Biotechnol. 1999; 13(1):29-44) such 
as the "green fluorescent protein" (GFP) (Sheen et al.(1995) Plant Journal 

25 8(5):777-784), chloramphenicol transferase, a luciferase (Ow et el. (1986) Sci- 

ence 234:856-859). the aequorin gene (Prasher et al. (1985) Biochem Biophys 
Res Commun 126(3):1259-1268). R^alactosidase. with glucuronidase being 
very particularly preferred (Jefferson et al. (1 987) EMBO J 6:3901-3907). 

30 c) Replication origins which allow replication of the expression cassettes or vectors 
accordmg to the invention in, for example. E.coli. Examples which may be men- 
tioned are OR! (origin of DNA replication), the pBR322 ori or the P15A orf (Sam- 
brook et al.: Molecular Cloning. A Laboratory Manual. 2"" ed. Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1 989) 

35 

d) Elements which are required foragrobacterlum-mediated plant transformation 
such as, for example, the right or left border of the T-DNA. or the vir region. 

To select cells which have successfully undergone homologous recombination or else 
40 cells which have succesfully been transformed, it is generally required additionally to 
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introduce a selectable marker which confers resistance to a biocide (for example 
a herbicide), a metabolism inhibitor such as 2-deoxyglucose-6-phosphate 
(WO 98/45456) or an antibiotic to the cells which have successfully undergone 
recombination. The selection marker permits the selection of the transformed cells 
5 from untransformed cells (McCormick et al. (1986) Plant Cell Reports 5:81-84). 

In addition, said recombinant expression cassette or vectors may comprise further 
nucleic acid sequences which do not encode a nucleic acid sequence SEQ (D NO: 1, 
SEQ ID NO: 3 or SEQ ID NO: 5 and whose recombinant expression leads to a further 

1 0 increase In fatty acid biosynthesis. By way of example, but not by limitation, such a 
proOIL nucleic acid sequence which is additionally expressed recombinantly can be 
selected from among nucleic acids encoding acetyl-CoA carboxylase (ACCase), 
glycerol-3-phosphate acyltransferase (GPAT), lysophosphatidate acyitransferase 
(LPAT), diacylglycerol acyltransferase (DAGAT) and phospholipidrdlacylglycerol 

1 5 acyltransferase (PDAT). Such sequences are known to the skilled worker and are 
readily accessible from databases or suitable cDNA libraries of the respective plants. 

An expression cassette according to the invention can advantageously be introduced 
into an organism or cells, tissues, organs, parts or seeds thereof (preferably into plants 
20 or plant cells, tissues, organs, parts or seeds) by using vectors in which the recombi- 
nant expression cassettes are present. The invention therefore furthermore relates to 
said recombinant vectors which encompass a recombinant expression cassette for a 
nucleic acid sequence SEQ ID NO: 1, SEQ ID NO: 3 or SEQ ID NO: 5. 

25 For example, vectors may be plasmlds, cosmlds, phages, viruses or else agrobacteria. 
The expression cassette can be introduced into the vector (preferably a plasmid vector) 
via a suitable restriction cleavage site. The resulting vector is first introduced into E.coli. 
Correctly transformed E.coli are selected, grown, and the recombinant vector is 
obtained with methods known to the skilled worker. Restriction analysis and sequenc- 

30 ing may be used for verifying the cloning step. Preferred vectors are those which make 
possible stable integration of the expression cassette into the host genome. 

The invention furthermore relates to transgenic plant organisms or tissues, organs, 
parts, cells or propagation material thereof which comprise a SEQ ID NO: 1, SEQ ID 
35 NO: 3 or SEQ ID NO: 5 as defined above, a transgenic expression cassette for a 
SEQ ID NO: 1, SEQ ID NO: 3 or SEQ ID NO: 5 or a transgenic vector encompassing 
. such an expression cassette. 

Such a transgenic plant organism is generated, for example, by means of trans- 
40 formation or transfection of the corresponding proteins or nucleic acids. The generation 
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of a transformed organism (or a transformed cell or tissue) requires introducing the 
DNA in question (for example the expression vector), RNA or protein into the host cell 
in question. A multiplicity of methods is available for this procedure, which is termed 
trans-formation (or transduction or transection) (Keown et al. (1990) Methods in 
5 Enzymology 185:527-537). Thus, the DNA or RNA can be introduced for example 
directly by microinjection or by bombardment with DNA-coated mfcroparticles. The cell 
may also be permeabilized chemically, for example with polyethylene glycol so that the 
DNA may reach the cell by diffusion. The DNA can also be introduced by protoplast 
fusion with other DNA-comprising units such as mlnicells, cells, lysosomes or lipo- 

10 somes. Electroporation is a further suitable method for introducing DNA; here, the 
cells are permeabilized reverslbly by an electrical pulse. Soaking plant parts in DNA 
solutions, and pollen or pollen tube transformation, are also possible. Such methods 
have been described (for example in Bilang et al. (1991) Gene 100:247-250; Scheid 
et al. (1991) Mol Gen Genet 228:104-1 12; Guerche et al. (1987) Plant Science 521 1 1- 

15 1 16; Neuhause et al. (1987) Theor Appl Genet 75:30-36; Klein et al. (1987) Nature 
327:70-73; Howell et al. (1980) Science 208:1265; Horsch et al.(1985) Science 
227:1229-1231; DeBlock et al. (1989) Plant Physiology 91:694-701; Methods for Plant 
Molecular Biology (Weissbach and Weissbach, eds.) Academic Press Inc. (1988); and 
Methods in Plant Molecular Biology (Schuler and Ziellnski, eds.) Academic Press Inc. 

20 (1989)). 



In plants, the methods which have been described for transforming and regenerating 
plants from plant tissues or plant cells are exploited for transient or stable transforma- 
tion. Suitable methods are, in particular, protoplast transformation by polyethylene 
25 glycol-induced DNA uptake, the biolistic method with the gene gun, what is known as 
the particle bombardment method, electroporation, the incubation of dry embryos in 
DNA-containing solution, and microinjection. 

In addition to these "direct* transformation techniques, transformation may also be 
30 effected by bacterial infection by means of Agrobacterium tumefaciens or Agro 
bacterium rhizogenes and the transfer of corresponding recombinant Ti plasmids 
or Ri plasmids by infection with transgenic plant viruses. Agrobacterium-mediated 
transformation is best suited to cells of dicotyledonous plants. The methods are 
described, for example, in Horsch RB et al. (1985) Science 225- 1229f) 

35 

When agrobacteria are used, the expression cassette is to be integrated into specific 
plasmids, either into a shuttle vector or into a binary vector. If a Ti or Ri plasmid is to be 
used for the transformation, at least the right border, but in most cases the right and 
left border, of the Ti or Ri plasmid T-DNA is linked to the expression cassette to be 
40 introduced as flanking region. 
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Binary vectors are preferably used. Binary vectors are capable of replication both in 
E.col. and in Agrobacterium. As a rule, they contain a selection marker gene and a 
linker or polylinker flanked by the right and left T-DNA border sequence. They can be 
5 transformed directly into Agrobacterium (Holsters et al. (1978) Mol Gen Genet 

163:181-187). The selection marker gene, which is, for example, the nptll gene which 
confers resistance to kanamycin. permits a selection of transformed agrobacteria The 
Agrobacterium which acts as host organism in this case should already contain a 
plasmid with the vir region. The latter is required for transferring the T-DNA to the plant 

10 cells. An Agrobacterium transformed in this way can be used for transforming plant 
cells. The use of T-DNA for the transformation of plant cells has been studied inten- 
sively and described (EP 120 516; Hoekema. In: The Binary Plant Vector System 
Offsetdrukkerij Kanters B.V.. Alblasserdam. Chapter V; An et al. (1985) EMBO J 4-277- 
287). Various binary vectors, some ofwhich are commercially available, such as for 

15 example, P BI101.2 or P BIN19 (Ctontech Laboratories, Inc. USA), are known. ' 

Further promoters which are suitable for expression in plants have been described 
(Rogers et al. (1987) Meth in Enzymol 153:253-277; Schardl et al. (1987) Gene 
61:1-1 1; Berger et al. (1989) Proc Natl Acad Sci USA 86:8402-8406), 

Direct transformation techniques are suitable for any organism and cell type. In cases 
where DNA or RNA are injected or electroporated into plant cells, the plasmid used 
need not meet any particular requirements. Simple plasmids such as those from the 
puc series may be used. If intact plants are to be regenerated from the transformed 
25 cells, it is necessary for an additional selectable marker gene to be present on the 
plasmid. 



Stably transformed cells, i.e. those which contain the inserted DNA integrated into the 
DNA of the host cell, can be selected from untransformed cells when a selectable 

30 marker is part of the inserted DNA. By way of example, any gene which is capable of 
conferring resistance to antibiotics or herbicides (such as kanamycin, G 418, bleo- 
mycin, hygromycin or phosphlnothricin and the like) is capable of acting as marker (see 
above). Transformed cells which express such a marker gene are capable of surviving 
in the presence of concentrations of such an antibiotic or herbicide which kill an un- 

35 transformed wild type. Examples are mentioned above and preferably comprise the bar 
gene, which confers resistance to the herbicide phosphinothricin (Rathore KS et al. 
(1993) Plant Mol Biol 21(5):871-884), the nptll gene, which confers resistance to 
kanamycin, the hpt gene, which confers resistance to hygromycin, or the EPSP gene, 
which confers resistance to the herbicide Glyphosate. The selection marker permits 

40 selection of transformed cells from untransformed cells (Mccormick et al. (1 986) Plant 
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Cell Reports 5:81-84). The plants obtained can be bred and hybridized in the custom- 
ary manner. Two or more generations should be grown in order to ensure that the 
genomic integration is stable and hereditary. 

The above-described methods are described, for example, in Jenes B etaJ (igg3) 
Techniques for Gene Transfer, in: Transgenic Plants. Vol. 1, Engineering and Utiliza- 
tion, edited by SD Kung and R Wu, Academic Press, pp.128-143, and in Potrykus 
(1 991) Annu Rev Plant Physiol Plant Molec Biol 42:205-225). The construct to be 
expressed is preferably cloned into a vector which is suitable for transforming Agrobac- 
tenum tumefaciens. for example P Bin19 (Bevan et al. (1984) Nucl Acids Res 12:871 if). 

Once a transformed plant cell has been generated, an intact plant can be obtained 
using methods known to the skilled worker. For example, callus cultures are used as 
starting material. The development of shoot and root can be induced in this as vet 
undifferentiated cell biomass in the known fashion. The plantfets obtained can be 
planted out and used for breeding. 

The skilled worker is familiar with such methods for regenerating plant parts and intact 
plants from plant cells. Methods which can be used for this purpose are. for example 
20 those described by Fennelletal. (1992) Plant Cell Rep. 11: 567-570; Stoegeretal ' 
(1995) Plant Cell Rep. 14:273-278; Jahne et al. (1 994) Theor Appl Genet 89:525-533. 

Transgenic", for example in the case of an OEP, refers to a nucleic acid sequence an 
expression cassette or a vector comprising said OEP nucleic acid sequence or to an 
25 organism transformed with said nucleic acid sequence, expression cassette or vector 
or all those constructs established by recombinant methods in which either 

a) the nucleic acid sequence encoding an OEP or 

30 b) a genetic control sequence, for example a promoter which is functional In plant 
organisms, which Is linked operably with said nucleic acid sequence under a) 

are not in their natural genetic environment or have been modified by recombinant 
methods, it being possible for the modification to be, for example, a substitution, 

35 addition, deletion, inversion or insertion of one or more nucleotide residues. Natural 
genetic environment refers to the natural chromosomal locus in the source organism or 
the presence in a genomic library. In the case of a genomic library, the natural genetic 
environment of the nucleic acid sequence is preferably retained, at least to some 
extent The environment flanks the nucleic acid sequence at least on one side and has 

40 a sequence length of at least 50 bp, preferably at least 500 bp. particularly preferably 
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at least 1000 bp, very particularly preferably at least 6000 bp. A naturally occurring 
expression cassette, for example the naturally occurring combination of the promoter of 
a gene coding for an OEP with the corresponding OEP gene, becomes a transgenic 
expression cassette when the latter is modified by non-natural, synthetic ("artificial") 
methods such as, for example, a mutagenization. Such methods are described in 
US 5,565,350; WO 00/15815; see also above. 

Host or starting organisms which are preferred as transgenic organisms are. above 
all, plants in accordance with the above definition. Included for the purposes of the 
invention are all genera and species of higher and lower plants of the Plant Kingdom, 
In particular plants which are used for obtaining oils, such as, for example, oilseed 
rape, sunflower, sesame, safflower, olive tree, soya, maize, wheat and nut species. 
Furthermore included are the mature plants, seed, shoots and seedlings, and parts, 
propagation material and cultures, for example cell cultures, derived therefrom. Mature 
plants refers to plants at any desired developmental stage beyond the seedling stage. 
Seedling refers to a young, immature plant at an early developmental stage. 

The transgenic organisms can be generated with the above-described methods for the 
transformation or transfection of organisms. 

The invention furthermore relates to the use of the transgenic organisms according to 
the invention and to the cells, cell cultures, parts - such as, for example, in the case 
of transgenic plant organisms roots, leaves and the like - and transgenic propagation 
material such as seeds or fruits which are derived therefrom for the production of food- 
stuffs or feedstuffs, pharmaceuticals or fine chemicals, in particular oils, fats, fatty acids 
or derivatives of these. 

Besides influencing the oil content, the transgenic expression of SEQ ID NO : 1, 
SEQ ID NO : 3, SEQ ID NO : 5 or derivatives thereof in plants may mediate yet further 
advantageous effects such as, for example, an increased stress resistance. Such 
osmotic stress occurs for example in saline soils and water and is an Increasing 
problem in agriculture. Increased stress tolerance makes it possible, for ex-ample, to 
use areas in which conventional arable plants are not capable of thriving for agricultural 
usage. 

The invention now having been generally described will be more readily understood by 
reference to the following examples, which are included for the purpose of illustration 
only, and are not intended to limit scope of the present invention. 
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General methods: 

5 Unless otherwise specified, all chemicals were from Fluka (Buchs), Merck (Darmstadt). 
Roth (Karlsruhe), Serva (Heidelberg) and Sigma (Deisenhofen). Restriction enzymes, 
DNA-modifying enzymes and molecular biological kits were from Amersham-Phanma- 
cia (Freiburg), Biometra (Gdttingen), Roche (Mannheim). New England Biolabs 
(Schwalbach), Novagen (Madison, Wisconsin, USA), Perkin-Elmer (Weiterstadt), 
1 0 Qiagen (Hilden), Stratagen (Amsterdam. Netherlands), Invitrogen (Karlsruhe) and 
Ambion (Cambridgeshire, United Kingdom). The reagents used were employed in 
accordance with the manufacturer's instructions. 

For example, oligonucleotides can be synthesized chemically in the known manner 
15 using the phosphoamidite method (Voet, Voet, 2 nd edition, Wiley Press New York, 

pages 896-897). The cloning steps carried out for the purposes of the present invention 
such as, for example, restriction cleavages, agarose gel electrophoreses, purification 
of DNA fragments, transfer of nucleic acids to nitrocellulose and nylon membranes, 
linking DNA fragments, transformation of E. coii cells, bacterial cultures, multiplication 
20 of phages and sequence analysis of recombinant DNA. are carried out as decribed by 
Sambrook etal. (1989) Cold Spring Harbor Laboratory Press; ISBN 0-87969-309-6. 
Recombinant DNA molecules were sequenced using an ABI laser fluorescence DNA 
sequencer following the method of Sanger (Sanger et al. (1977) Proc Natl Acad Sci 
USA 74:5463-5467). 

25 

Example 1 

Decreased incorporation of [ 14 C]Glycerol 3-Phosphate into lipids of a yeast strain in 
which the YPR140w gene was disrupted. 

30 

A haploid yeast (Saccharomyces cerevisiae) strain, (EUROSCARF accession no. 
Y15555; EUROSCARF, University Frankfurt D-60439 Frankfurt Germany,) with 
the YPR140w gene deleted, and the corresponding wild type (wt) strain By4742 
(MA To his3A1, leu2AO, Iys2 AO, ura3A0), were acquired from the EUROSCARF yeast 

35 strain collection. Yeast cells were grown in iiquid YPD (1% yeast extract, 2% peptone 
and 2% glucose). For preparing microsomal membranes two yeast strains. Y15555 
and wt, were grown in 750 ml cultures in liquid YPD on rotary shaking at 30° C. The 
cultures were started by inoculation from overnight liquid-YPD pre-cultures to an ODboo 
of 0.05 and the cells were harvested at an OD m of about 1 .5, by centrifugation for five 

40 minutes at 3000rpm. The cell pellets were washed once in deionised water and frozen. 
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Cell pellets were thawed and washed with breaking buffer (0.6 M Mannitol, 20mM Tris- 
HCI pH 7.6, ImM EDTA, 1ug/ml Aprotinin, 0.7ug/ml Pepstatin. 0.5 ug/ml Leupeptin) 
before being centrifuged for five minutes at 30000 rpm. The cells were then re- 
suspended in breaking buffer (one ml of breaking buffer per g of wet cell pellet) and 
then broken by passing them 2-3 times through a French Press (Ammco SLM instru- 
ments, Urbane lllinios). The cells were diluted 1.5 times with breaking buffer and 
centrifuged at 1000g for five minutes at 4°C. The supernatant was then centrifuged 
two additional times, first at 5000g for 10 minutes then at 100,000 g for sixty minutes 
both at 4°C. The microsomal pellet was re-suspended in 20mM Tris-HCI, pH 7.6 to a 
protein concentration of approximately 10mg/ml and homogenized with glass homoge- 
nizer. Microsomes were stored frozen at-70°C. Protein concentrations were deter- 
mined using the BCA method (Pierce) with BSA as standard protein. The enzymatic 
incorporation of [ 14 C]glycerol-3-phosphate were analysed by adding 50pl of an reaction 
mixture (50mM Tris-HCI pH 7.5, 8 mM NaF, 4mM MgCI 2 , 0.4 mM 10000dpm/nmol 
t 14 C]-G3P, 4 mg/ml BSA, 0.2mM 16:0-CoA, 2 mM DTT) to 5ul of microsomal prepara- 
tions in a final volume of 100 ul. After incubation at 30°C for 10 min the reactions were 
stopped by adding 400ul of methanol/chloroform/ glacial acetic acid (50:50:1 ) and 
100ul of water followed by thoroughly shaking and centrifugation for twenty seconds at 
13000 rpm. The bottom chloroform layer was removed and transferred into scintillation 
vials containing 4 ml of scintillation fluid (ethanol/ethanol 2:1, 0.4% 2(1-butlphenol)-5- 
(4-biph-enyl)-1 , 3, 4-oxadizole). Radioactivity counting was carried out in a LKB 
Wallace 1290 Rackbeta Liquid scintillation counter. 

The effect of deleting the YPR140w gene In yeast was studied on the in vitro biosyn- 
thesis of lipids by analysing the incorporation rate of [ 14 C]glycerol-3phosphate into 
lipids. The glycerol-3-phospate acyltransferase (GPAT) reaction is the first step in the 
lipid biosynthesis in yeast, both for phospholipids and triacylglycerols. The lysophos- 
phatidic acid formed by GPAT activity can be further metabolised into phosphatidic acid 
and diacylglycerol by other enzymes which are part of the so called Kennedy pathway 
which are also present in the microsomal membrane fraction. The method used in this 
experiment to analyse the incorporation of [ 14 C]glycerol-3phosphate into lipids, is 
taking advantage of the fact that the labelled substrate is water soluble while the 
products are lipid soluble. This means that a simple chloroform extraction wjth scintilla- 
tion counting of the chloroform phase will give the amount of incorporated [ t4 C]glycerol- 
3phos-phate into lipids, which Is the same as the GPAT activity as all labelled lipid 
products formed has to pass the GPAT acylation step. The YPRf40w-deletion and wild 
type yeast strains were grown in liquid cultures and microsomal membrane fractions 
were prepared from harvested cell pellets. Removing the gene had little or no effect on 
the growth rate as determined by optical density measurements. However, the 
incorporation of [ ,4 C]glycerol-3phosphate into lipids when microsomal membranes 
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were incubated in the GPAT assay was clearly reduced to about 60% of wild type 
figure 1. These results indicate that the YPRUOwgene might encode a GPAT era 
or another type of protein that affects microsomal GPAT activity indirectly. 

5 Example 2 



Increased incorporation of [« CJGIycerol 3-Phosphate into lipids in crude extracts 
of wt yeast over expressing the gene YPR140W. 

10 For induced high-level expression of the YPR U0w gene, a 1146 bp DNA frag- 
ment as described in SEQ ID NO: 1 , from start to stop codon, was amplified from 
Swevisiae W303 genomic DNA by using a 1:1 mixture of Taq and pfu DNA poly- 
merases with the 5' primer, ATGTCTTTTAGGGATGTCCTAGA, and the 3' primer 
TCAATCATCCTTACCCTTTGGTT. The resulting PCR product was gei purified ' 

15 incubated with Taq polymerase and TA-cloned into the pCR2.l-TOPO cloning vector 
(Invitrogen). The YPR140w gene was then excised from the cloning vector by EcoRI 
d«gestion and cloned into the EcoRI behind the strong inducible GAL1 promoter in the 
multicopy plasmid pYES2 (Invitrogen). thus generating the plasmid pAN3. The wt yeast 
straw By4742 (MA 7a hls3A 1, leu2 AO, fys2 AO, urs3 AO), was transformed with pAN3 

20 and pYES2 using standard protocol. Overnight precultures of the two yeast transfor- 
mants, wt yeast containing pAN3 or pYES2 f were grown in 15 ml of synthetic liquid 
media lacking uracil and supplemented with 2 % Glucose (vol/vol). All yeast cultures 
were grown on rotary shaking at 30°C. The precultures were used to inoculate sixty ml 
of synthetic liquid media lacking uracil and supplemented with 3% glycerol and 3% 

25 lactate to a start OD^ of 0.2. The sixty ml cultures were grown for forty-eight hours 
(ODeoo about 1.5) and then the GAL1 over expression promoter was induced by the 
addition of galactose to a final concentration of 2% (vol/Vol) and the cultures were 
shaken for a further 6 hours at 30°C Cells were harvested (ODsoo about 2.5) by 
centrifugation five minutes at 3000rpm. Pellets were washed once with water and 

30 frozen. Cell pellets were thawed, resuspended in ice-cold breaking buffer (20mM Tris- 
HCi pH 7.6. 1mM EDTA, 1pg/ml Aprotinin, 0.7pg/ml Pepstatin, 0.5 ug/ml Leupeptin)to 
1,5 ml and added to two ml screw cap tubes containing 1 ml of acid washed glass 
beads (diameter 0.45-0.5 mm). The tubes were shaken at 4°C for 3 minutes in a 
M?nibeadbeater-8 (Techtum lab and then centrifuged for 10 minutes at 3000 rpm. The 

35 supernatant was harvested and stored at -70 °c. Protein concentrations were deter- 
mined using the BCA-method (Pierce) with BSA as protein standard. Enzymatic assays 
were run for ten minutes at 30°C The samples (I00pg of protein) were diluted to 50 pi 
with water and the assay was started by adding 50pl of assay mixture (50mM Tris-HCI 
pH 7.5. 8 mM NaF, 4mM MgCI 2 . 0.4 mM 10000dpm/nmol r; 4 C>G-3-P, 4 mg/ml BSA, 

40 0.2mM 16:0-CoA, 2 mM DTT). The assays were stopped by adding 400 pi of metharv 



BASF Plant Science GmbH 



20020908 Ll'L\ ••KJV9W 



ol/ch(oroform/ gladal acetic acid (50:50:1) and 100ui of water foltowed by thoroughly 
shaking and centrifugaBon for twenty seconds at 13000 rpm. The bottom chloroform 
layer was removed and transferred into scintillation vials containing 4 ml of scintillation 
fluid (ethancl/ethanol 2:1, 0.4% 2(1-butlphenol)-5.<4-biphen y l)-1. 3. 4-oxa-dizole) 
Radioactivity counting was carried out in a LKB Wallace 1290 Rackbeta Liouid 
scintillation counter. 
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The effect of over expression of the YPR^Ow gene was studied on the rate of in vitro 
incorporation of ["qglycerol-Sphosphate into lipids by crude cellular extracts Over 
expression of the YPR 140* gene was achieved by cloning the gene behind the strong 
galactose inducible GAL1 promoter and transforming the generated plasmid P AN3 
into wt yeast. The over expression plasmid, pYES2, without any gene behind the GAL1 
promoter was used as an "empty plasmid" control in the experiment Incorporation 
of glycerophosphate into lipids is catalyzed by the GPAT (glycerol 3-phosphate 
acyltransferase) enzyme, which utilizes acyl-CoA as acyldonor for the reaction and 
gives lysophosphatidic acid as product The GPAT reaction is the first step in the lipid 
biosynthesis in yeast, both for phospholipids and triacylglycerols. The formed lyso- 
phosphatidic add can be further metabolized into phosphatide acid and diacylglycerol 
by other enzymes, which are also present in the crude cellular extracts and are part of 
the so-called Kennedy pathway. The method used Sn this experiment to analyze the 
GPAT activity, or the incorporation of [ 14 C]glycerol-3phosphate Into lipids, is taking 
advantage of the fact that the labeled substrate is water-soluble while the products 
are lipid soluble, and that all labeled lipid products formed have to pass the GPAT 
step. This means that a simple chloroform extraction with scintillation counting of the 
25 chloroform phase will give the amount of incorporated [ 14 CJglycerol-3phosphate into 
lipids, which is the same as the GPAT activity. 

Induced high-level over expression of the YPR140 gene SEQ ID NO: 1 from the pAN3 
plasmid in wt yeast increased the incorporation of [ 14 C]glycerol-3phosphate into lipids 

30 2.5 fold as compared to empty plasmid control, see figure 2. The increase in incorpora- 
tion of [ C]glycerol-3phosphate into lipids is an effect of increased GPAT activity, but 
this does not mean that theYPRUOw gene product necessarily has to be a GPAT. 
The in vitro assay is done in a crude extract containing a very complex mixture of 
enzymes and proteins. Over expression of YPRUOw SEQ ID NO: 1 in wt yeast leads 

35 to an increased GPAT activity. 
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Trracylglycerol accumulation in yeast cells expressing the YPR140w gene 

For Induced high-level expression of the YPKf40wgene, a 1146 bp DMA fraa- 
ment as described in SEQ ID NO: 1, from start to stop codons, was amplified from 
S. cerews/ae W303 genomic DNA by using a 1:1 mixture of rag and pft, DNA poly- 
merases with the 5' primer, ATGTCTTTTAGGGATGTCCTAGA. and the 3' primer 
TCAATCATCCTTACCCTTTGGTT. Tbe resulting PGR product was gel purged 
mcubated w,th Taq polymerase and TA-cloned into the pCR2.1-TOPO cloning vector 
Onvt^gen). The YPR14C* gene was then excised from the cloning vector by 
d.gestjon and cloned into the EcoRI behind the strong inducible GAL1 promoter in the 
mufocopy plasmid P YES2 (Invitrogen). thus generaflng the plasmid P AN3. The ^ 
hype yeast stra,n By4742 (MATahis3A1, ,eu2AO, ,ys2A0. ^transformed with 

C K U,tlVatSd 3t 30 "° 00 3 r ° tary Shaker - ^thetic medium (ShTrln F 
et ah. ( 986) Laboratory Course Manual for Methods in Yeast Genetics. Cold Spring 

"^° r , ; IT, ' RainVfeW ' NY ' > ,aCWn9 Uraci ' and s "PP'emented with 2 % (vol/Vol) 
gfycerol and 2 % (vol/vol) ethanol. The GAL1 promoter was induced after 24 hours of 
growth by the addition of 2 % (wt/vol) final concentration of galactose. Cells were 
harvested after an additional 24 or 28 hours of growth. Wild type cells By4742 
transformed with the empty vector ( P YES2) and cultivated under identical conditions 
were used as a control. ' 

In order to quantify the total lipid content, 3 x 5 ml aliquots from yeast cultures were 
harvested by centrifugation, and the resulting pellets were washed with distilled water 
and lyophillsed. The weight of the dried ceils was determined, and the fatty acid con- 
tent was quantified by conventional gas-liquid chromatography (GLC) analyses after 
6487^6492) (DMqm * ^ (2 °° 0) PrOC ' NaV - Aca * Sci. USA 97, 

The lipid content was then calculated as nmol fatty adds per mg dry weight. The lipid 
composition of the yeast was determined in cells harvested from 35-ml liquid cultures 
The harvested yeast cells were re-suspended in 15 ml glass tubes in water to a final ' 
volume of 0.6 ml, to which 3.75 ml chloroform: methanol (1:2), 50 pi acetic acid and 
2 ml of glass beads (0.45 - 0.50 mm) were added. The yeast cells were disrupted by 
vigorous agitation (5 x 1 min) and the lipids were extracted into chloroform according 
to standard method (Bligh, E.G. and Dyer. W.J., Can. J. Biochem. Physiol. 37(1959), 
91 1-917). The collected lipid fraction was divided in two parts and separated by TLC 
on Silica Gel 60 plates (Merck) In hexane / diethyl ether / acetic acid (70:30:1) for the 
quantification of neutral lipids, i.e. unesterified fatty acids (FA), diacylglycerols (DAG) 
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Phabdytethanolamine <P 6 >. vJZ^tSZZSTT m " d Ph0S " 
5 and identified by means J^SZ^ ^exposure «o , 2 vapors 

excised material at 85"C for 60 mln in iv »w« . • preparea "* "eabng the 
methyt estere were extmcted arZ^mit! It ' ■* h d,y me,haTOI - ™» 
(GLC) ana,yses as I SSS j£T ^ "-*»" 

Ttie effect of high-level expression of the YWMOwoene. <:cr> .r. . 
accumulation was studied by transfom.inn ZlZl ° ' D NQ 1on 
pAN3 P lasmid contain^ ate '£££2?, T** *"* ** By4742 wth *• 
promoter. The Z^ ^T^T^Tf ^^'^ 
IS did not retare the growth rate as ^ ^I k " W% ' m fr <™ this promoter 
expression ofths C^rjTSX^r'r^T^- 
after an additional 24 hours (Exp #1 ) or at^ZZ^jL ? b """■•'•a 
content was determined as nm^SL^L ^ *> 0,cu " iva,,0 ''• T» total lipid 
signficantlyhlgherin cells ^^1^ ^""^^ 

20 rrrr: 1 - ^r^r .rof^e^rr^, 

transformants harvested 24 hours after Induction (Exp #i in Fto aHZT. ^ * 
compared to centre, ceils (pYES2^ Similarly me PpW contends 2 1 k ' 2 
in Fig. 3) when harvested 28 hours after *JL*ZZ - ' 9her (Exp * 2 

Furthermore, no major -.^.SSSKlE™* C ° n ' M ^ 
25 over expression of the YPRUOw «.„« tT . com P° !s '«'°'' occurred by the 
examined and «. £Z£%Z£^££ *>" ^ - 

glycerol, is strengty elevated in oe«s oTr «pre^ £ taSE^a WaCyl " 
pared to control cells (tabled th. , , 7 press,n « me v ' D '"'»0«r(pAN3)ascom- 

30 compared to the empty vector control. In cX na^ 24 ^1' T % ' 
<*<P*1 in table 3, the tnacylgtycero, o^ JZZSf^Z^ 
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Lipid content 
(nmol FA per mg dry weight) 


Polar lipids 


DAG 


FA 


TAG 


SE 


EXD#1 


Exp#2 


Exp#1 


Exp#2 


Exp#1 


Exp#2 


Exp#1 


Exp#2 


Exp#1 


Exp#2 


PYES2 


81 


99 


7.2 


7.8 


6.6 


6.0 


92 


104 


4.9 


5,6 


pAN3 


72 


67 


9.7 


5.0 


6.3 


1.3 


130 


169 


5.9 


7.0 



Table 3. 



5 The lipid content iii yeast transformants. Yeast cells, were transformed with the pAN3 
vector expressing the YPR140w SEQ ID NO: 1 from the GAL1 inducible promoter, and 
cells trans-formed with empty vector (pYES2) were used as control. The content of 
diacylglycerol (DAG), unesterified fatty acids (FA), triacylglycerol (TAG), and steryl 
esters (SE) was determined in yeast cells harvested 24 hours after induction and 
1 0 calculated as nmol fatty acid (FA) per mg dry cell weight 

A significant decrease in the overall polar lipid content was also observed due to the 
elevated expression of the YPR140w gene. The polar lipid content per dry cell weight 
decreased with up to 32% (table 4). Moreover, the relative distribution of the major 
15 polar lipid species as presented in table 1 differs between the overexpressor and the 
control cells. The relative content of phosphatidylinositol (PI) and phosphatidylethanol- 
amine (PE) is decreased whereas phosphatidylcholine (PC) and phosphatidylserine 
(PS) is increased in the YPR140w transfbrmant. 





Relative content of polar lipids 
(%) 


PI 


PS 


PC 


PE 


PYES2 


10,6 


10,3 


47,0 


32,1 


pAN3 


4,8 


12.0 


59,1 


24,1 



Table 4. 

The relative composition (%) of phosphatidylinositol (PI), phosphatidylserine (PS), 
phosphatidylcholine (PC) and phosphatidylethanolamine (PE) in the different trans- 
25 formants harvested after 28 hours after induction. Yeast cells were transformed with 
the pAN3 vector expressing the YPRUOw SEQ ID NO: 1 from the GAL1 inducible 
promoter. Yeast cells transformed with empty vector (pYES2) were used as control. 



BASF Plant Science GmbH 



20020908 LI' LI ' J<JV9l8H 8ZS 9uqddi3 



38 

In summary, the triacylglycerol content in yeast cells expressing YPR140w SEQ ID 
NO: 1 from a strong promoter was increased with 61 % as compared to the control 
which demonstrates the potential use of these genes in increasing the oil content in 
transgenic organisms. 

5 

Example 4 

Identification of genes homologous to YPR140w in plants. 

10 To identify possible homologues to the YPR140w yeast gene SEQ ID NO: 1, the NCBI 
database were searched for related plant sequences. From these BLAST searches two 
protein sequences In Arabldopsis thatiana were identified with 31,6 % (At140.1) and 
26.3% (At140,2) sequence identity to the deduced amino acid sequence of the open 
reading frame YPRUQw using the GAP programme. From an EST database the full- 

15 length sequences of the corresponding cDNAs were identified and named At140. 1 and 
At140.2. The Arabldopsis AX140. 1 and AH40.2 genes were then PCR amplified from an 
Arabidopsis seedling cDNA library. The AM40.1 gene was amplified from the cDNA by 
Pfu Turbo DNA polymerase (Stratagene, USA) with the use of the primers AM40.1-S1: 
GTCGGTCTTTCTAACTGAATC and At140.1-A1:CCTGTGGGACTTAAACCTCA. 

20 The PCR product was diluted 10 times and used as a template in a second PCR 
with internal primers At140.1-S2:CAGAATGGGAATTCATTTTG and AM40.1-A2: 
CTAACGGGAGTTTAACTTGCA, Similarly, the At140.2 gene was amplified from the 
cDNA by using the primers At140.2-S1: CTGGTCTCGTTTCTAATTG and At140.2-A1: 
CATGGCGAATCTAAACCGGAAC. The resulting DNA fragments were purified and 

25 cloned into pCR-Blunt ll-TOPO (Invitrogen life technologies, USA), generating pCR- 
AM40.1 and pCR-At140.2, respectively. The genes were verified by sequencing, see 
SEQ ID NO : 3 and SEQ ID NO : 5. 

Example 5 

30 

Triacylglycerol accumulation in yeast cells expressing the plant genes At 140* 1 or 
AH40.2 

The At140. 1 and the At140.2 genes SEQ ID NO: 3 and SEQ ID NO: 5 were excised 
35 from the cloning vectors pCR-At140.1 and pCR-At140.2 by BamHI and Notl digestions, 
respectively. The excised genes were then subcloned into the BamHI and Notl sites, 
respectively, behind the strong inducible GAL1 promoter in the multicopy plasmid 
pYES2 (Invitrogen, USA), thus generating the plasmids pY-At140.1 and pY-At140.2, 
respectively. The wild type yeast strain By4742 (MA fa his3A1, Ieu2 AO, Iys2 AO, 
40 ura3 A0) $ transformed with these plasmids were cultivated at 30 °C on a rotary shaker 
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in synthetic medium (Sherman et al. r 1986) lacking uracil and supplemented with 2 % 
(vol/vol) glycerol and 2 % (vcl/vol) ethanoL The GAL1 promoter was induced after 24 
hours of growth by the addition of 2 % (wt/voi) final concentration of galactose. Cells 
were harvested after an additional 24 hours of growth. Wild type cells By4742 trans- 
5 formed with the empty vector, pYES2 and cultivated under identical conditions were 
used as a control. The lipid content of the yeast cells was determined as described in 
Example 3. 

The effect of high-level expression of the At140. 1 (SEQ ID NO: 3) or the AI140.2 gene 

10 (SEQ ID NO: 5) on lipid accumulation was studied in yeast transformants expressing 
the At140. 1 or the AH40.2 gene, respectively, under control of the galactose-induced 
GAL1 promoter. The gene expression was induced after 24 hours and cells were 
harvested after an additional 24 hours of cultivation. High-level expression of these 
genes showed only minor effects on the growth rate as determined by optical density 

1 5 measurements. However, the total lipid content, determined as nmol fatty acids (FA) 
per mg yeast, was strongly elevated in the At140. 1 or the AH40.2 transformants as 
compared to cells transformed with an empty vector. The total lipid content (Fig. 4) of 
the AH40. 1 (pY-At140.1) and AH40.2 (pY-AtT40.2) transformants was 42 % and 45% 
higher, respectively, as compared to control cells (pYES2). The increased lipid content 

20 is mainly explained by the strongly elevated triacytglycerol content in the At140. 1 and 
the AH40.2 transformants (table 5). The triacytglycerol content in these transformants 
was 79 % and 92 % higher, respectively, as compared to the empty vector control. 
The polar lipid content was slightly lowered and contents of unesterified fatty acids 
and diacylglycerois were increased whereas the steryt ester content was not affected 

25 by the over expression of the Arabidopsis genes, AH 40. 1 and AH40.2. 





Lipid content 
(nmol FA per mg dry weight) 


Polar lipids 


DAG 


FA 


TAG 


SE 


PYES2 


83 


5.0 


3.6 


73 


6.7 


PY-AM40.1 


75 


9.1 


21 


131 


7.3 


PY-A1140.2 


70 


8.8 


24 


140 


6.8 



Table 5. 



30 The lipid content in yeast transformants. Yeast cells, were transformed with the pY- 
At140.1 or the pY-140.2 vector expressing the AH 40.1 or the AH40.2 from the GAL1 
inducible promoter. Yeast cells transformed with empty vector (pYES2) were, used 
as control. The content of diacylglycerol (DAG), unesterified fatty acids (FA), triacyl- 
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glycerol (TAG), and steryl esters (SE) was determined in yeast cells harvested 

24 hours after induction and calculated as nmol fatty acid (FA) per mg dry cell weight. 

The major polar lipids in yeast are phosphatidylinositol (PI), phosphatidylserine (PS), 
5 phosphatidylcholine (PC) and phosphabdylethanolamine (PE). The effects on the 
relative distribution of these, major polar lipid classes by the over expression of the 
At140. 1 or the Atl 40.2 gene are presented in table 6. The most significant effect was 
observed in the phosphatidylinositol (PI) content of the Atl 40. 1 or the Af 740.2 trans- 
formants, with a 39% and a 33% decrease in the relative content as compared to 
10 control cells. 





Relative content of polar lipids 
(%) 


PI 


PS 


PC 


PE 


pYES2 


11.0 


12.2 


54.3 


29.7 


pY-At140.1 


6.7 


11.1 


48.7 


33.4 " 


PY-AM40.2 


7.4 


11.9 


47.5 


33.2 



Table 6. 



1 5 The relative composition (%) of phosphatidylinositol (PI), phosphatidylserine (PS), 
phosphatidylcholine (PC) and phosphatidylethanolamine (PE) in the different trans- 
formants harvested after 24 hours after induction. Yeast cells were transformed with 
the pY-At140. 1 or the pY-At140.2 vector expressing the At140. 1 or the AtUO.2 gene, 
respectively, from the GAL1 inducible promoter. Yeast cells transformed with empty 

20 vector (pYES2) were used as control. 

The effect on the fatty acid profile by the over expression of the Af f 40. 1 or the At140.2 
gene were also analysed. Only minor changes in response to the over expression of 
these genes on the fatty acid composition of the polar lipids were observed (data not 
25 shown). However, the levels of oleic (18:1) acid increased and the palmitic (16:0) 

decreased in triacylglycerols accumulated in yeast cells expressing the At1 40. 1 or the 
At 140.2 gene (table 7). A minor increase in the stearic acid (18:0) content was also 
observed. 
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Relative fatty acid composition in triacylglycerol 

(%) 


16:0 


16:1 


18:0 


18:1 


pYES2 


30.5 


31.5 


13.2 


24.8 


pYnAtI40.1 


14.5 


31.3 


15.4 


38.3 


pY-At140.2 


14.2 


31.4 


15.0 


39.3 



Table 7. 



The fatty acid composition of triacyiglycerofs in yeast cells transformed with the pY- 
5 At140.l or the pY-At140.2 vector expressing the At 140. 1 or the At140.2 gene, respec- 
tively, from the GAL1 inducible promoter. Yeast ceils transformed with empty vector 
(pYES2) were used as control. Major fatty acids present in yeast are palmitic (16:0), 
palmitoleic (16:1), stearic (18:0), and oleic (18:1) acid. 

10 In summary, the triacylglycerol content in yeast cells expressing, the At 140. 1 (SEQ ID 
NO: 3) or the At140.2 gene (SEQ ID NO: 5) from a strong promoter was increased with 
78 or 92 % as compared to the control, which demonstrates the potential use of these 
genes in increasing the oil content in transgenic organisms. 

15 Example 6 

Transgenic plants expressing the At 140. 1 orAt140.2 gene 

Cloning and analysis of oil content in plant: For induced high-level expression of the 
20 At1 40.1 (SEQ ID NO: 3) orAt140.2 gene (SEQ ID NO: 5) in plants, the genes were 
cloned into a binary vector system under the control of either a seed-specific or 
a constitutive promoter. An Agroftacfer/um-mediated plant transformation binary vector 
system was applied, including the primary cloning vector pART7-35S with a CaMV35S 
promoter or pART7-Nap with the seed specific napin promoter from Brassica napus 
25 (Josefsson, L 6. et al., J. Biol. Chenru 262 (1987). 12196-12201) coupled with a 
transcriptional terminator sequence and a binary Ti vector pGII0229 (Hellens, R.P. 
et al., Plant Mol. BioL 42 (2000), 819-832). A DNA fragment, containing gene AM40.1, 
was excised by BamHI and EcoRVfrom the cloning vector pCR-At140.1 (described in 
example 4) and was then subcloned into BamHI and Smal treated pART7-35S or 
30 pART7-Nap, generating p35S-At140.1 and pNap-At140.1, respectively. From these 
plasmids the expression cassettes containing the promoter, the At140.1 gene and the 
transcriptional terminator sequence was excised by Notl cleavage. The fragments were 
then inserted into the Notl site of the pGH0229 plasmid, generating the vectors pGII- 
35S-At140.1 and pGII-Nap-At140.1, respectively. 
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Plant expression vectors with the gene At140.2 expressed from a constitutive 
CaMV35S or from a seed specific napin promoter were constructed. First the Notl site 
was first deleted from the pCR-At140.2 by cutting with Not1 , filling and re-ligation. 
5 Thereafter the AtUO.2 gene was released by Hindlll and Xbal and inserted into the 
corresponding sites of the plasmids pART7-35S or pART7-Nap, generating plasmids 
p35S-At140.2 and pNap-At140.2, respectively. The expression cassettes were then 
excised from these plasmids and introduced into Notl site of pGH0229, generating 
plasmids pGII-35S-At140.2 and pGII-Nap-At140.2, respectively. 

10 

Using floral dip essentially as described by Clough and Bent, 1998, plants were 
transformed with Agrobacterium tumefaciens GV3101 harboring the plasmid pGII-35S- 
AM40.1, pGII-Nap-At140.1, pGII-35S-At140.2 and pGII-Nap-At140.2 respectively. 
Entire plants (inflorescence and rosette) were submerged for 20 - 30 sec In the 

1 5 infiltration media consisting of 5% sucrose, 0.02% Silwet L-77 (Osi Specialties. 

Danbury, CT) and re-suspended transformed A. tumefasciens cells. Plants were then 
transferred to a growth chamber with a photoperiod of 16 h of light at 21°C and 8 h 
of dark at 18°C (70% humidity). The T1 seeds were collected from mature plants. 
Subsequently transformed plants were identified on selection media by growing T1 

20 seeds on MS-agar plates supplemented with BASTA (gulfosinate ammonium) 1 5 mg/l, 
cefotaxime 50 mg/L Alternatively T1 plants were grown on soil trays and sprayed three 
times at an interval of two days with 0.4% BASTA. 

Plant lipids can be extracted from plant material as described by Cahoon et al. (1999, 
25 Proc. Natl. Acad. Sci. USA 96, 22:12935-12940) and Browse et al. (1986, Anal. 

Biochemistry 442:141-145). Qualitative and quantitative lipid or fatty acid analysis is 
described in Christie, William W., Advances in Lipid Methodology. Ayr/Scotland :Oily 
Press. - (Oily Press Lipid Library; Christie. William W. ( Gas Chromatography and 
Lipids. A Practical Guide - Ayr, Scotland :Oily Press, 1989 Repr. 1992 - IX 307) 

30 

The oil content of T2 seeds of the Arabidopsis transformants was analyzed by the use 
of conventional gas-liquid chromatography (GLC) and calculated as mg fatty acids per 
mg dry seeds (Fig. 5). As control, seeds from wild type plants were used. The over 
expression of the AI140.1 (SEQ ID NO: 3) or the At140.2 gene (SEQ ID NO: 5) does 
35 result in an increase in total lipid content 
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We claim: 

1 . A method of increasing the total oil content in a plant organism or a tissue, organ, 
part ceKor propagation material thereof, comprising 

5 

a) the transgenic expression of a polypeptide SEQ ID NO : 2, SEQ ID NO : 4 
or SEQ ID NO ; 6 in said plant organism or In a tissue, organ, part, cell or 
propagation material thereof, and 

10 b) the selection of plant organisms in which - in contrast to or comparison with 

the starting organism - the total oil content in said plant organism or in a tis- 
sue, organ, part, cell or propagation material thereof is increased. 

2. The method as claimed in claim 1, wherein the oil biosynthesis enhancing protein 
15 is encoded by a nucleic acid sequence selected from the group consisting of: 

a) a nucleic acid sequence comprising a nucleotide sequence which is at 
least 60% identical to the nucleic acid sequence of SEQ ID NO : 1 , 
SEQ ID NO : 3 or SEQ ID NO : 5; 

20 

b) a nucleic acid sequence comprising a fragment of at least 30 nucleotides 
of a nucleic acid sequence comprising the nucleotide sequence of 
SEQ ID NO :1, SEQ ID NO : 3 or SEQ ID NO : 5; 

25 c) a nucleic acid sequence which encodes a polypeptide comprising an amino 

acid sequence at least about 60% identical to the amino acid sequence of 
SEQ ID NO : 2, SEQ ID NO ; 4 or SEQ ID NO : 6 and 

d) a nucleic acid sequence which encodes a fragment of a polypeptide com- 
30 prising the amino acid sequence of SEQ ID NO : 2, SEQ ID NO : 4 or 

SEQ ID NO : 6 or wherein the fragment comprises at least 10 contiguous 
amino acid residues of the amino acid sequence of SEQ ID NO : 2, 
SEQ ID NO : 4 or SEQ ID NO : 6. 

35 3. A method as claimed in claim 1 or 2, wherein the plant is an oil crop. 



4. A method as claimed in claim 1 or 2, wherein the total oil content in the seed of a 
plant is increased. 
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5. An expression cassette comprising in combination with a regulatory sequence a 
nucleic add sequence selected from the group consisting of: 

a) a nucleic acid sequence comprising a nucleotide sequence which is at least 
5 60% identical to the nucleotide sequence of SEQ ID NO : 1 , SEQ ID NO : 3 

or SEQ ID NO : 5, 

b) a nucleic acid sequence comprising a fragment of at least 30 nucleotides 
of a nucleic acid sequence comprising the nucleotide sequence of SEQ ID 

10 NO :1, SEQ ID NO: 3 or SEQ ID NO: 5, 

c) a nucleic acid sequence which encodes a polypeptide comprising an 
amino acid sequence at least about 60% identical to the amino acid 
sequence of SEQ ID NO : 2, SEQ ID NO : 4 or SEQ ID NO : 6, or 

15 

d) a nucleic acid sequence which encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO : 2, SEQ ID 
NO : 4 or SEQ ID NO : 6 wherein the fragment comprises at least 
10 contiguous amino acid residues of the amino acid sequence of 

20 SEQ ID NO : 2, SEQ ID NO : 4 or SEQ ID NO : 6 

wherein said regulatory sequence is capable of mediating expression of said nu- 
cleic acid sequence in a plant. 

25 6. An expression cassette according to claim 5. wherein said nucleic acid se- 
quence encodes a polypeptide comprising the amino acid sequence set forth in 
SEQ ID NO : 2. SEQ ID NO : 4 or SEQ ID NO : 6. 

7. An expression cassette as claimed in claim 5 or 6, wherein the promoter is a 
30 seed-specific promotor. 

8. A genetically modified plant organism or tissue, organ, part, cell or propagation 
material thereof, comprising a polypeptide as defined in SEQ ID NO : 2, 

SEQ ID NO 4 or SEQ ID NO 6 or an expression cassette as claimed in any of 
35 claims 5 to 7. 
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9. A genetically modified plant organism as claimed in claim 8, wherein the plant 
organism is selected from the group of the oil crops consisting of Borvago offici- 
nalis. Brassica campestris, Brassica napus, Brassica rapa, Cannabis sativa, Car- 
thamus tinctorius, Cocos nucifera. Crambe abysslnica, Cuphea species. Elaeis 
5 guinensis, Elaeis oleifera, Glycine max, Gossypium hirsutum, Gossypium bar- 

badense, Gossypium herbaceum, Helianthus annuus. Unum usitatissimum, Oe- 
nothera biennis, Olea europaea. Oryza sativa. Ricinus communis. Sesamum in- 
dicum, Triticum species, Zea mays, walnut and almond. 

10 10. The use of a genetically modified plant organism or tissue, organ, part, cell or 
propagation material thereof as claimed in claim 8 or 9 for the production of oils, 
fats, free fatty adds or derivatives of the above. 
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Use of genes for increasing the oil content in plants 
Abstract 

5 The invention relates to methods for increasing the oil content in plants, preferably in 
plant seeds, by expressing the Ypii40w polypeptide from yeast or corresponding poly- 
peptides from plants. The invention furthermore relates to expression constructs for 
expressing the yeast polypeptide Ypr140w or corresponding polypeptides from plants 
in plants, preferably in plant seeds, the transgenic plants expressing the polypeptide 
10 and to the use of said transgenic plants for the production of food, feed, seed, pharma- 
ceuticals or fine chemicals, in particular for the production of oils. 
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SEQUENCE LISTING 

<:ilo> BASF Plane Science QribH 

5 <120> Use of genes for increasing the oil content: in plants 

<130* PF54384 

<140> AB20020908 
10 <141> 2003-04-14 

<160> 35 

<170> Patentln Ver* 2.1 

15 

<210> 1 

<2XX> 1X46 

<r212> DMA 

<213=> S ac charoray ces cerevisiae 

20 

<i220> 

<221> CDS 

<222s> (1)-.<114£) 

25 <400> 1 

atg tct tct agg gat gtc eta gaa aga gga gat gaa tct tta gaa gec 48 
Met Ser Phe Arg Asp Val lieu Glu Arg Gly Asp Glu Phe Leu Glu Ala 
15 XO 15 

30 tat ccc aga aga age ccc ctt tgg aga tct cct tea tac agt aca, tea 96 
Tyr Pro Arg Arg Ser Pro Leu Trp Arg Phe lieu Ser Tyr Ser Thr Ser 
20 25 30 

tta ctg acc ttc ggt gca tea aaa ctg ctt ctt tte aea tge tat aat 144 
35 lieu lieu Thr Phe Gly Val Ser Lys Leu Leu Leu Phe Thr Cys Tyr Asn 
35 40 45 

gtc aaa ttg aat ggt ttt gaa aaa tta gaa act gec ttg gaa cgt tec 192 
Val Lys Leu Asn Gly Phe Glu Lys Leu Glu Thr Ala Leu Glu Arg Ser 
40 50 55 60 

agg gaa aat aga ggc ett atg acg gtc atg aae cat atg agt atg 240 
Lys Arg Glu Asn Arg Gly Leu Met Thr Val Met Asn His Mfct Ser Met 
65 70 75 80 



45 



gtc gat gat ccg tta gtt tgg gca aca eta eca tat aag tta ttt acg 
Val Asp Asp Pre Leu Val Trp Ala Thr Leu Pro Tyr Lys Leu Phe Thr 
85 90 95 



268 



50 tct ttg gac aac ata aga tgg tet ttg ggt gca eat aat act tgc ttt 336 
Ser Leu Asp Asn He Arg Trp Ser Leu Gly Ala His Asn lie Cya Phe 
100 105 lio 



caa aac aaa ttt ctg gec aac ttt ttc tea ctt ggc caa gtc ctt tea 
55 Gin Asn Lys Phe Leu Ala Asn Phe phe Ser Leu Gly Gin Val Leu Ser 
115 120 125 

aca gaa aga ttt ggg gtg ggc cea ttt caa ggt tct ata gac get tea 
Thr Glu Arg Phe Gly Val Gly Pro Phe Gin Gly Sar Xle Asp Ala Ser 
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130 135 



140 



gat tea atg ttt aga caa att eta cca aga aae ttt ggc tec gaa ata 
Asp Ser Met Phe Arg Gin lie Leu Pro Arg Asn Phe Gly Ser Glu lie 
245 250 255 



52a 



ata aga ttg tta age cct gac gac act tta gac teg gaa tgg ace ect 480 
lie Arg Leu Leu Ser Pro Asp Asp Thr Leu Asp Leu Glu Trp Thr So 
D " S 150 1S5 ltJ0 

cac tet gag gtc tet tct teg eta aaa aaa gec tac tec ecg ccc ata 
His Ser Glu Val Ser ser Ser Leu Lys Lys Ila Tyr Ser S p" i£ 
10 165 "o 17S 

ata agg teg aag cca tet tgg gtc cat gtt tat cca gaa gga ttt trta 57* 
Xle Arg Ser Lye Pro Ser Trp Val His Val Tyr Pro Ilu Ily p£ 
XB0 185 ia 0 



624 



eta eaa tta tat ccg cct ttt gaa aat teg atg agg tat ttt aaa tgg 
Leu Gin Leu Tyr Pro Pro Phe Glu Aan Ser Met Arg Tyr Phe Lys Trp 
135 200 20S 

f* c f aa at H ate eta gaa gca aca aag ecg ccc att gca gta 672 
Gly lie Thr Arg Met He Leu Glu Ala Thr Lys Pro Pro Xle Val Val 
21° 215 220 



cca ata ttt get aca ggg ttt gaa aaa ata gea tee gaa gca gtc aea 720 
Pro lie Phe Ala Thr Gly Phe Glu Lys He Ala Ser Glu Ala Val Thr 
25 22S 230 235 240 



768 



aat gtt acc ata ggg gat cct tta aat gat gat tta ate gac agg tat 816 
Asn Val Thr He Gly Asp Pro Leu Asn Asp Asp Leu He Asp Arg Tyr 
260 265 270 

aga aaa gaa tgg aca cat ttg gtt gaa aaa tac tat gat ccc aaa aat 864 
Arg Lys Glu Trp Thr His Leu Val Glu Lys Tyr Tyr Asp Pro Lys Asa 
27S 280 285 

cet aac gac etc tet gac gaa ttg aaa tat got aaa. gag gcg eaa gat 912 
Pro Asn Asp Leu Ser Asp Glu Leu Lys Tyr Gly Lys Glu Ala Gin Asp 
290 235 300 



tta aga age aga tta gec get gaa ctg aga gec cat gtc get gaa att 960 
Leu Arg Ser Arg Leu Ala Ala Glu Leu Arg Ala His Val Ala Glu He 
4S 305 310 315 320 



aga aat gaa gtt cgc aaa tta cea ege gaa gae cct agg ttc aaa tee 1008 
Arg Asn Glu Val Arg Lys Leu Pro Arg Glu Asp Pro Arg Phe Lys Ser 
325 330 335 

eec tea tgg tgg aag egg tte aac ace acg gaa ggt aaa teg gac cca loss 
Pro Ser Trp Trp Lys Arg Phe Asn Thr Thr Glu Gly Lys Ser Asp Pro 
340 345 350 



55 gat gtt aaa gtc att ggc gaa aat tgg gca ata agg agg atg eaa aag 1104 
Asp Val Lys Val He Gly Glu Asn Trp Ala He Arg Arg Met Gin Lys 
355 360 365 

ttt ctg ect cca gag ggt aaa cca aag ggt aag gat gat tga 1146 
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Plie lieu Pro Pro Glu Gly Lys Pro Lys Gly Lys Asp Asp 
370 375 380 



5 <210> 2 

<211:> 381 
<212> PUT 

<213> SaccHaxomyc es cerevisiae 

10 <:400> 2 

Met Ser Phe Arg Asp Val Leu Glu Arg Gly Asp Glu Phe Leu Glu Ala 

1 S lo is 

Tyr Pro Arg Arg Ser Pro Leu Trp Arg Phe Leu Ser Tyr Sear Thr Ser 
20 25 30 

15 Leu Leu Thr Phe Gly Val Ser Lys Leu Leu Leu Phe Thr Cys Tyr Aan 
35 40 45 

Val Lys Leu Asn Gly Phe Glu Lys Leu Glu Thr Ala Leu Glu Arg Ser 

SO 55 60 

Lys Arg Glu Asn Arg Gly Leu Men Thr Val Met Asn His Met Ser Met 
20 65 70 75 80 

Val Asp Asp Pro Leu Val Trp Ala Thr Leu Pro Tyr Lys Leu Phe Thr 

85 90 95 

Ser Leu Asp Asn lie Arg Trp Ser Leu Gly Ala His Asn lie Cys Phe 
100 105 110 

25 Gin Asn Lys Phe Leu Ala Asn Phe Phe Ser Leu Gly Gin Val Leu Ser 
US 120 125 

Thr Glu Arg Phe Gly Val Gly Pro Phe Gin Gly Ser tie Asp Ala Ser 

130 135 140 

lie Arg Leu Leu Ser Pro Asp Asp Thr Leu Asp Leu Glu Trp Thr Pro 
30 145 150 155 160 

His Ser Glu Val Ser Ser Ser Leu Lys Lys Ala Tyr Ser Pro Pro He 

16S 170 175 

He Arg Ser Lys Pro Ser Trp Val His Val Tyr Pro Glu Gly Phe Val 
180 IBS 190 

35 Leu Gin Leu Tyr Pro Pro Phe Glu Asn Ser Met Arg Tyr Phe Lys Trp 
195 2G0 205 

Gly lie Thr Arg Met He Leu Glu Ala Thr Lys Pro Pro He Val Val 

210 215 220 

Pro lie Phe Ala Thr Gly Phe Glu Lys He Ala Ser Glu Ala Val Thr 
40 225 230 235 240 

Asp Ser Met Phe Arg Gin He Leu Pro Arg Asn Phe Gly Ser Glu He 

245 250 255 

Asn Val Thr He Gly Asp Pro Leu Asn Asp Asp Leu He Asp Arg Tyr 
260 265 270 

45 Arg Lys Glu Trp Thr His Leu Val Glu Lys Tyr Tyr Asp Pro Lys Asn 
275 280 285 

Pro Asn Asp Leu Ser Asp Glu Leu Lys Tyr Gly Lys Glu Ala Gin Asp 

290 295 300 

Leu Arg Ser Arg Leu Ala Ala Glu Leu Arg Ala His Val Ala Glu He 
50 305 " 310 315 320 

Arg Asn Glu Val Arg Lys Leu Pro Arg Glu Asp pro Arg Phe Lys Ser 

325 33 0 335 

Pro Ser Trp Trp Lys Arg Phe Asn Thr Thr Glu Gly Lys Ser Asp Pro 
340 345 350 

55 Asp Val Lys Val He Gly Glu Asn Trp Ala He Arg Arg Met Gin Lys 
355 360 365 

Phe Leu Pro Pro Glu Gly Lys Pro Lys Gly Lya Asp Asp 
370 375 380 
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<210> 3 
<211> 1374 
5 <212:> DMA 

<213> Arabidopsis thaliana 

<220* 
«£221> CDS 
10 <222> (5) . . (1348) 

c400> 3 

caga atg gga att cat: ttt gtt gae aag gca gat eta tgg aag agt gca 49 

Men Gly He His Phe VaL Asp Lys Ala Asp Leu Trp Lys Sex- Ala 

15 1 5 10 15 

etg ttg ttc aat ctt aaa ett cgt gat cga ttt ega ate gec gtc gat 97 
Leu Leu Phe Asn Leu Lys Leu Arg Asp Arg Phe Arg He Ala Val Asp 
20 25 3D 

gat cae cgt ggt ega get acg gtc ttc tea eeg gat ggt tgc etc tct 145 
Asp Hia Arg Gly Arg Ala Thr Val Phe Ser Pro Asp Gly Cys Phe Ser 
35 40 45 



20 



25 tec act ate cae cge tgg gtg act ega ttc egg aae ttt cgc egg gag 193 
Sex Thr He His Arg Trp Val Thr Arg Phe Arg Asn Phe Arg Arg Glu 
50 55 60 

tct etc ect tct eca ceg get ttt tat cge aga cga gtt tct aag gae 241 
30 Ser Leu Pro Ser Pro Pro Ala Phe Tyr Arg Arg Arg Val Ser Lys Asp 
65 70 7S 

tea acg gea gaa gaa gag tet get ctt ttc egg atg eta caa act gtg 289 
Leu Thr Ala Glu Glu Glu Sear Ala Leu. Phe Arg Met Leu Gin Thr Val 
35 80 85 90 95 

get gtt cee ctt att gga aat get tgt cat gtt ttc atg aat ggt ttt 337 
Ala Val Pro Leu He Gly Asn Ala Cys His Val Phe Met Asn Gly Phe 
loo 105 lio 



40 



aae cgt gtt eag gta tat ggt tta gag aaa ttg cat gat get tta etc 385 
Asn Arg Val Gin Val Tyr Gly Leu Glu Lys Leu His Asp Ala Leu Leu 
115 1*0 125 

45 aae aga cea aag aae aag ect ctt gta acg gtg age aat cat gtt gca 433 
Asn Arg Pro Lys Asn Lys Pro Leu val Thr Val Ser Asn His Val Ala 
130 135 140 

tct gtg gat gat cca ttt gtc att get tea tta ett cea ect aaa ttt 461 
50 Ser Val Asp Asp Pro Phe Val He Ala Ser Leu Leu Pro Pro Lys Phe 
145 ISO 155 

eta ctt gat gee cgt aat ttg agg tgg acg etc tgt get aca gat aga 
Leu Leu Asp Ala Arg Asn Leu Arg Trp Thr Leu Cys Ala Thr Asp Arg 
55 160 1*5 170 175 



529 



tgc ttt aaa aae ect gta act tea get ttc tct cga tet gtc aaa gtt 577 
Cys Phe Lye Asn Pro Val Thr Ser Ala Phe Ser Arg Ser Val Lyj Val 
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ctg cca att tct cgt ggt gaa gga act tat cag cag gga atg gac att 625 
Leu Pro lie Ser Arg Gly Glu Gly lie Tyr Gin Gin Gly Met Asp He 
135 200 205 

gcg att ccg aaa ttg aac aac gga ggg tgg gtt cac ate tct cca gaa 673 
Ala He Ser Lys Leu Asn Asn Gly Gly Trp Val His He Phe Pro Glu 
210 215 220 

10 ggc agt cgc tec egg gat ggt gga aag act atg ggc tea gcg aag agg 721 
Gly ser Arg Ser Arg Asp Gly Gly Lys Thr Met Gly ser Ala Iiys Arg 
225 230 235 

ggt ate gga agg ttg act ttg gac gca gat act etc cct atg gtt gtt 769 
15 Gly He Gly Arg Leu He Leu Asp Ala Asp Thr lieu Pro Met Val Val 
240 245 250 255 

cct ctt gtg cat act ggt atg cag gat ata atg cca gtt gga get agt 817 
Pro Phe Val His Thr Gly Met Gin Asp He Met Pro Val Gly Ala Ser 
20 260 265 270 

gtt cca egg att ggc aag aca gtg aea gtg ate att gga gac cct att S6S 
Val Pro Arg He Gly Lys Thr Val Thr Val He He Gly Asp Pro He 
275 260 285 

25 

cat ttt aat gac att etc age act gaa gga gec caa eac gee tea agg 913 
Hiss Phe Asn Asp He Leu Ser Thr Glu Gly Ala Gin His Val Ser Arg 
290 295 300 

30 aaa cac ctg tat gac gee gtt teg tec aga att gga caa aga ctg tac 961 
Lys His Leu Tyr Asp Ala Val Ser Ser Arg He Gly Gin Arg lieu Tyr 
305 310 315 

gat tea aaa gca caa gtt gat aga gta tac ata gaa caa caa tec acg 1009 
35 Asp Leu Lys Ala Gin Val Asp Arg Val Tyr He Glu Gin Gin Ser Met 
320 325 330 335 

atg tea cac aat gec aaa aca ccc teg gac cgt get gec gag ate ttt 10S7 
Met Ser His Asn Ala Lys Thr Pro Ser Asp Arg Ala Ala Glu He Phe 
40 340 345 350 

cat aga gtc gat tgg gac tea ttt ggg acg gga gca caa ttt tea gaa H05 
His Arg Val Asp Trp Asp Ser Phe Gly Met Gly Ala Gin Phe Ser Glu 
355 360 36S 

45 

gaa tea tea ccc agt agt aaa ccg att ggc caa age gat gat cgc att 1153 
Glu Ser Ser Pro Ser Ser Lys Pro He Gly Gin Ser Asp Asp Arg He 
370 375 3B0 

50 gtc aga tct ccc aaa agg aga gec tea ccc gaa gga ggg gtc age ttg 1201 
Val Arg Ser Pro Lys Arg Arg Val Ser Pro Glu Gly Gly Val Ser Leu 
385 390 39S 



aag att aag aag etc atg gac tea acc gag atg atg ggg ttt gcg get 
55 Lys He Lys Lys Leu Met Asp Ser Thr Glu Met Met Gly Pne Ala Ala 
400 405 410 «5 

aga ggg tea eta atg aac gag tac aag agt egg get gaa tct gee aat 
Arg Gly Leu Leu Met Asn Glu Tyr Lys Ser Arg Val Glu Ser Ala Asn 
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420 425 430 

gtt ggt agg cet tba aag get tgg aga gag tat ttt atg aae cgt gga 1345 
VaX Gly Arg Pro Leu Lys Ala Trp Arg Glu Tyr Phe Met Asn Arg Gly 
435 440 445 



tta taacttgeaa gttaaactcc cgttag 
Leu 



<2io> 4 
<211> 440 
<2X2> PM 

c2l3> Arabidopsis thaliana 



<400> 4 

Met Gly lie His Phe Val Asp Lys Ala Asp Leu Trp Lys Ser Ala Leu 
1 5 10 X5 

20 Leu Plie Asn Leu Lys Leu Arg Asp Arg Phe Arg Xle Ala val Asp Asp 
20 25 30 



His Arg Gly Arg Ala Thr Val Phe Ser Pro Asp Gly Cys Phe Ser Ser 
35 40 45 

Thr Xle His Arg Trp Val Thr Arg Phe Arg Asn Phe Arg Arg Glu Ser 
50 55 60 



Leu Pro Ser Pro Pro Ala Phe Tyr Arg Arg Arg Val Ser Lys Asp Leu 
30 65 70 75 B0 

Thr Ala Glu Glu Glu Ser Ala Leu Phe Arg Met Leu Gin Thr Val Ala 
85 90 95 

35 Val Pro Leu lie Gly Asn Ala Cys His Val Phe Met Asn Gly Phe Asn 
100 105 xxo 



Arg Val Gin Val Tyr Gly Leu Glu Lya Leu His Asp Ala Leu Leu Asn 
115 X20 X25 

Arg Pro Lys Asn Lys Pro Leu Val Thr Val Ser Asn His Val Ala Ser 
130 135 140 



Val Asp Asp Pro Phe Val Xle Ala Ser Leu Leu Pro Pro Lya Phe Leu 
45 X45 150 155 160 

Leu Asp Ala Arg Asn Leu Arg Trp Thr Leu Cys Ala Thr Asp Arg Cys 
1S5 170 175 

50 Phe Lys Asn Pro Val Thr Ser Ala Phe Ser Arg Ser Val Lys Val Leu 
X80 185 ISO 

Pro lie Ser Arg Gly Glu Gly Xle Tyr Gin Gin Gly Met Asp lie Ala 
X95 2Q0 205 

Xle ser Lys Leu Asn Asn Gly Gly Trp Val His Xle Phe Pro Glu Gly 
2X0 215 220 

Ser Arg Ser Arg Asp Gly Gly Lys Thr Met Gly Ser Ala Lys Arg Gly 
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225 «° «5 240 

He Gly Arg Leu He Leu Asp Ala Asp Thr Leu Pro Met Vel Val Pro 

5 245 2 5D 255 

Phe val His Thr Gly Met Gin Asp Xle Met Pro Val Gly Ala Ser Val 
250 265 270 

in Ile Gly Lys Thr Val Val Ile Ile G1 y ^ Pro He His 

1U 275 280 28S 

Phe Asn Asp Xle Leu Ser Thr Glu Gly Ala Gin Hie Val Ser Arg Lys 
290 235 300 

15 His Leu Tyr Asp Ala Val Ser Ser Arg He Gly Glu Arg Leu Tvr Asr, 
305 3 " 315 12% 



20 



3S 



Leu Lys Ala Gin Val Asp Arg Val Tyr He Glu Gin Gin Ser Met Met 
325 3310 335 

Ser His Asn Ala Lys Thr Pro Ser Asp Arg Ala Ala Glu He Phe His 

345 3so 



Arg Val Asp Trp Asp Ser Phe Gly Met Gly Ala Gin Phe Ser Glu Glu 
25 35S 360 S65 

Ser Ser Pro Ser Ser Lys Pro Xle Gly Gin Ser Asp Asp Arg lie Val 
370 375 380 

30 Arg Ser Pro Lys Arg Arg Val Ser Pro Glu Gly Gly Val Ser Leu Lys 
3fiS 350 335 400 



He Lys Lys Leu Met Asp Ser Thr Glu Met Met Gly Phe Ala Ala Arg 
40S 410 41S 

Gly Leu Leu Met Asn Glu Tyr Lys Ser Arg Val Glu Ser Ala Asn Val 

420 42S 430 



Gly Arg Pro Leu Lys Ala Trp Arg Glu Tyr Phe Met Asn Arg Gly Leu 
40 435 440 445 



<210> 5 
45 <211> 961 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 
50 <221> CDS 

<222> (38). -(889) 

<400> 5 

ctggtctcgt tcctaatcgg agagaacttt gaagcta atg ggg aag ata atg gaa 55 
55 Met Gly Lys He Met Glu 

1 5 

tgg gca gca aga tct gat cat ttg gga gga att cca agg aat act gtg 103 
Trp Ala Ala Arg Ser Asp His Leu Gly Gly Ile Pro Arg Asn Thr Val 
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10 15 20 

ata atg get gtt agt gca ttc gca aaa gca gta gca aat cut: toe aat 
lie Met Ala VaX Ser Ala Phe Ala Lye Ala Val Ala Asn Leu Cys Am 
25 30 3s 

aaa age tea gtt cac aat gca gat act ett atg aat ett gtc caa tea 
Lys Ser Ser Val His Asn Ala Asp Thr Lea Met Asa Leu Val Gin Ser 
4 & 45 50 

aga cca cct ggt gtt ect etc ate act gtt agt aat cac atg teg act 
Arg Pro Pro Gly Val Pro Leu lie Thr Val Ser Asn His Met Ser Thr 
55 6S vo 

ttg gat gat cca gca atg tgg ggg gca ttt aag ggt etc ctt tec tta 
Leu Asp Asp Pro Val Met Trp Gly Ala Phe Lys Sly Leu Leu Ser Leu 
75 ao as 

gat cca gaa ttg get egg egg gtt ctt get gca gag gat ata tgt ttc 
Asp Pro Glu Leu Ala Arg Trp Val Leu Ala Ala Glu Asp lie Cys Phe 
90 *5 100 



151 



199 



247 



295 



343 



agg aac cct ata ttc tec tac att ttc cgc aet gga aaa tgt ata cct 391 
Arg Asn Pro lie Phe Ser Tyr lie Phe Arg Thr Gly Lys Cys He Pro 
25 10S no 115 

ata act aga ggt ggt gga ate cac caa gaa aac atg aat gaa get etc 439 
He Thr Arg Gly Gly Gly He Tyr Gin Glu Asn Met Asn Glu Ala Leu 
120 125 130 



cag cga tta aaa gat gga tct tgg etg cat acc ttc cca gag gga aag 467 
Gin Arg Leu Lys Asp Gly Ser Trp Leu His Thr phe Pro Glu Gly Lys 
135 140 145 150 

gtg ttt caa gat gat gtt cct ata aga cga ett aaa tgg gga aet gca 535 
Val Phe Gin Asp Asp Val Pro lie Arg Arg Leu Lys Trp Gly Thr Ala 
1S5 lfio X65 

age etc ate gec cgt tee cca gtt acc cca ate gtt ttg cca ata att 583 
Ser Leu He Ala Arg Ser Pro Val Thr Pro He Val Leu Pro He He 
170 175 1B0 

cac cgt ggt ttc gag gag atg atg ecg gag aac tac aat aat gga cga 631 
His Arg Gly Phe Glu Glu Met Met Pro Glu Asn Tyr Asn Asn Gly Arg 
45 IBS 190 135 



aga cca ctg gta ccg ttg ecg aac aaa cac ctt aaa gtt gtg gtt ggt 679 
Arg Pro Leu Val Pro Leu Pro Asn Lys His Leu Lys Val Val Val Gly 
200 205 210 

gaa cca att gag ttt gat gtt cca atg atg gtt gag act get gtc ttg 727 
Glu Pro He Glu Phe Asp Val Pro Met Met Val Glu Thr Ala Val Leu 
21S 220 225 230 

55 gae tec cgc cat gta acc cct cct ctt caa gaa gtg aaa tgg cct gtc 775 
Asp Ser Arg His Val Thr Pro Pro Leu Gin Glu Val Lys Trp Pro Val 
23S 240 ^ 245 

etc aet tct get ggc caa gtg eta gac gaa act get cag aga cac etc 823 
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lieu Thr ser Ala Gly Gin Val Leu Asp Glu Thr Ala Gin Arg His Leu 
250 255 260 

tac ata get exg tec gag aag att caa tec tec teg gaa aca ttg aga 871 
5 Tyr lie Ala Leu Ser Glu Lys He Gin Ser Ser Leu Glu Thr Leu Arg 
265 270 275 

etc tta gee aag egg ttg tgacttccgc egggaaactt eatatcaaaa 919 
Leu Leu Ala Lys Arg Leu 
10 280 

tccggttcac aacttttaaa gttecggttt agattcgeea tg 961 

15 <210> 6 

<2115> 2B4 
<212> PRT 

<213* Arabidopsis thaliana 
20 <400> C 

Met Gly Lys He Met Glu Trp Ala Ala Arg Ser Asp Hie Leu Gly Gly 
15 10 IS 

He Pro Arg Asn Thr Val He Met Ala Val Ser Ala Pke Ala Lys Ala 
25 20 25 30 

Val Ala Asn Leu Cya Asn Lys Ser Ser Val His Asn Ala Asp Thr Leu 
35 " 40 45 

30 Met Asn Leu Val Gin Ser Arg Pro Pro Gly Val Pro Leu He Thr Val 
50 55 60 



35 



50 



ser Asn His Met Ser Thr Leu Asp Aap Pro Val Met Trp Gly Ala Phe 
65 70 75 80 

Lys Gly Leu Leu Ser Leu Asp Pro Glu Leu Ala Arg Trp Val Leu Ala 
BS 90 95 



Ala Glu Asp He Cya Phe Arg Asn Pro He Phe Ser Tyr He Phe Arg 
40 loo 105 HO 

Thr Gly Lys cys He Pro He Thr Arg Gly Gly Gly He Tyr Gin Glu 
115 120 125 

45 Asn Met Asn Glu Ala Leu Gin Arg Leu Lys Asp Gly Ser Trp Leu His 
130 135 140 

Thr Phe Pro Glu Gly Lys Val Phe Gin Asp Asp Val Pro He Arg Arg 
145 150 1SS 160 

Leu Lys Trp Gly Thr Ala Ser Leu He Ala Arg Ser Pro Val Thr Pro 
165 170 175 

He Val Leu Pro He He His Arg Gly Phe Glu Glu Met Met Pro Glu 
55 xao 185 

Asn Tyr Asn Asn Gly Arg Arg Pro Leu Val Pro Leu Pro Asn Lys His 
195 200 20=) 
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Leu Lye Val Val Val Gly Glu Pro lie Glu Phe Asp Val Pro Met Met 
210 215 220 

Val Glu Thr Ala Val Leu Asp Ser Arg His Val Thr Pro Pro Leu Gin 
S 225 230 " 23S 24,0 

GlU Val Lys Trp Pro Val Leu Thr Ser Ala Gly Gin Val Leu Asp Glu 
245 250 2S5 

10 Thr Ala Gin Arg His Leu Tyr He Ala Leu Ser Glu Lys lie Gin Ser 
260 265 270 



15 



55 



Ser Leu Glu Thr Leu Arg Leu Leu Ala Lys Arg Leu 
275 280 



<210> 7 
<211> 403 
20 <212> X3UA 

<213> Zea mays 

*220> 

<221> mis cofeature 
25 <222> (1) (403) 

<400> 7 

ecgctcexea ccgtcageaa tcacatgtcc aegttagatg aeecgcctat gttggcgafct so 
csaagggtttt ccaacfctcgg acgcaaagct tggaaggcgg gtgctgacag cagaagafcat 120 
30 acgetteaag aatgtagtea tgtectacat gtttegactt gggaaatgca taceaatcac 180 
gagagggggt gggatctatc aagaaeaeat gaatgaagec cttgacgtgc ttagaaatgg 240 
aggctggctg catacattcc ctgaaggaaa aatagcccaa gaagateage cgattagaag 3 00 
afctgaagfcgg ggaactgcca gtettatcgt: ccgagcacct ataactccaa fcagttttgec 360 
aattgtfceac tctggtttcg aaaaggtcat gccagaaaac ccg 403 

35 

e210> 8 
<211> 423 
«s212> DMA 
40 <213s> Glycine maac 

<220> 

<221> miac^feature 
<222> (1) .7(423) 

45 

*400> 8 

ttcccgggtc gaecaccatc ggcgccgccg aeaceaccac acctttatcc cctctgccga 60 
tggctacttc tcctccacca ttcaacgcfcg ccccagcega ttccgtgatc tccgaaggga 120 
ttcattgect textccacet cttbecateg taaacgagtg attaaggafct tcagttcfcga 180 
50 ggaagactca gctcttgfcte gtacgatgca agetgttgeg gttectgttc ttggaaatgt 240 
ctgteaegtg tttatgaaeg gattaaacca ggcgeaggcg tatggtttag aaaaactgca 300 
ccccgcgttg ctgeatagae ctaagggcaa aecccttctt acggceagca atcafcgtfcgc 36 o 
ctecauggat gatccgcfctg ttategetfce getgcttect tegagtgttc tcttggatge 420 
tan 423 



<210i> B 
<211> 40B 
<;212* MA 
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<2X3> Glycine max 
<220> 

<^221> misc^feature 
<=222> CI).. (408) 

<:220> 

<221> misc_f eature 

^222* (i).7f4oa) 



<400> 9 

tttlt^t f= aac * caac atgtcacgga egatggagtg ggcggcgagg gcggagcacc SO 

tcccagaaaa cfcC g tgatt g cggcggtggg eggattcgce aaaacigtgt 120 
cgtctttcct caar=accg« gatgtxcaca aeggcgacae tctectecge ctegtccget ISO 
15 ccagacccca ccgcgtccce ctcattaccg ttagcaatca catgtc<=a« ttggatgate III 
SSSZ SESS 5553?=? — ttagct cgatgggctc 30 o 



fccgctgccga agatatttgc fctcaggaatg ccctctattc ctatattttt cggctjcata 360 
ettttccaga aggaaaagtg tateaagaag atgcacctat aaggcaat ~ 40 & 



*210:> 10 
<211> 368 
<212=> BHA 

<213> Iiinum usltatissimum 
<220> 

<221> raise feature 
<222> (1) .7(368) 

30 <400> 10 

cggttagcaa ccatgttget cctgtngatg acceatctgt gatcgcctca tfcgctactsac 60 

eaagagtact tttggatgct cagaactfcga ggtggaeact ttgegcaagg ategetgctt 120 

taggaatccc gtcaettccg eattcttaag actgtcaaag tettgecect ccctcgtggt 180 

catggagttt at<=agaaggg tatggacacg gcaattgcga acgtgaacac tggtggetgg 240 

gtteacatct tcccggaggt agccgttcca aggatggtgg gaaaactafcg gggtetatta 300 

aaagaggtgc tggaaggttg gtacttgatg ctgatacttc tccccattgt agtceeggtc 360 

gtgeacac " 36fl 



40 <210> li 

<211> 376 

<212> DMA 

<213> Liuum uaitatisaintum 

45 <22D> 

<221> mis e_fe attire 
<222* (1) . . (376) 



<400> 11 

tcgtagttag agaaectcag atggctggga tcaeaaggaa tgcagtgttt gtgaccgteg go 
gtgcetfctgc taaggcagtg agtagtcfctc tgaacaatac afccagtceac aatgcagaea 120 
etetaettcg ectagttcga tetcggcscgc ctggtgfcaec tcteatcact gtcageaacc 180 
acatgtcaac gttagatgat cctetgacgt ggggatfccaa gggattccea accatgggat 240 
gegaaattgt ttcgatgggt atgggctgct gaagacatct gttfccaggaa ttcttttcat 3 00 
55 cottacctcc ttegcacggg gaaatgtatt cecattacaa gaggtggggg aatttaccgg 360 
agccacatga atgaag ~" 376 

<210> 12 
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12 

<211> 418 

<213> Liixum usitatisaimum 

«=220> 

<221> xnise_feature 
<222> (1)..(41B) 

<400> 12 

ccgccfcggtg tacetcteafc cactgttagc aateaeatgt caacgttaga fcgaccctetg 60 

atgtggggat fceaagggatt cceaateatg aatgcgaaat: tgtttcgatg ggcafctaget 120 

gctgaagaca tetgtttcag gaattctcct cattectaet, tctttcgcat ggggaaatgt 180 

atecccatta caagaggtgg tggaatttat caaagccaea tgaatgaagc fccttcagcgc 240 

tcgagcaatg gtgattggefc geacacattc cctgagggaa aggccaacca agaaaetgga 300 

15 cctacaagac gattgaaatg gggaactgec agtctcateg tecgtgcccc tgttacaecg 360 

atagtattac ccattgttca rcgtggcttt caagaggtga tgccagagaa ctacctat 418 

<210* 13 
20 <21l> 445 
<212> DMA. 
c213> Glycine max 

<220> 

25 <221> miscjEeature 
«=222> (1) -7(445) 

<400> 13 

ttccgggtcg accgccattg gegecgecgc eaetaccacc acacctttat cceetcegcie 60 
30 gatggctact tctcctceacs cattcaaegc tggcteagce gatttcgtga tttccgeaga 120 
gaefcegfctgc cgtegtceac etccttttat cgeaaacgag tgattaagga fcttcagttct 180 
gaagaagatc eaactcttgt tcgtatgatg eaagccgttg eggttcetgfc tcttggaaat 240 
gtctgteacg tgttcatgaa cggattaaae agtgtgcagg tatatggctt agaaaaactg 200 
cactcegctt: tactgcaaag acctaaagga aaacctcttc tcacggtcag caateatgct 360 
35 gctfcccatgg atgatcefcct tgttafctgct tcgctgctcc ctccgagtgt tctttcggac 420 
gctaggaatc tcagatggae gcctc 445 

«210*> 14 

40 <211> 361 

<212> DMA 

<213> Hordeum vulgar a 
<220* 

45 <22l> mis cofeature 

<222> (1) -7(361) 

<400> 14 

ggaaccggtg atcctccgea fcggttcaagc tgtggcggtt cctctattgg gaaatfacgxg 60 
50 ctacgtgttc atgaatggcc tcaaccgcgt tcaggttcat ggcctggaga agcrgcacaa 120 
ggcattgctt gagaggceta aggacaagce cctagcaacg gttagcaacc atgttgetttr 180 
tgtcgatgac ccatttgtga ttgcttcatt gcfcaccacca agagtaettt tggatgctca 240 
gaacttgagg tggacacttt gcgcaacgat cgctgtttta ggaatcccgt cacttct gca 30O 
ttctttaaga ctgtcaaagt ettgeceete tcfccgtggte acggagttta tcagaagggfc 360 
55 a 361 



<210> 15 
<211> 472 



22^81' S 8SS2S09 T29 6tH- 
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<212> DNA 

<213> Sraasica napue 
<220* 

<22l> mi b c_f eat ure 
<222> (1) - . (472) 

<400b> 15 

cgfcatcggaa tttccgggte gaegaccacc gceggagagc cgcggtttta fcatacggacg So 
gttacttctc tteetccatc caecgcttgg ctgetcgatt gcggaactfcc cgcegcgagt: 120 
ctctcectte tgeccccgct ttttatcgea gaagagtacc fcaaagatfctg aeggcagaag lao 
aagagtctgc fcatcttccgg atgetteaag ctgtggctgt tccacttatt ggaaacgctt 240 
gteatgtctt catgaatggt entaaccgtg ttcaggtgta tggtctagag aagttgeatg 300 
atgctctgct caacaggcca aagaacaagc ctetegtaac ggttagcaat catgtggeat 3 60 
ccttggatga teeatttgtc attgcttcgt taettecgce taagcttcta etcgatgctc 420 
gtaatttgag gcggacgctt tgtgctacag atagatgctt fcaagaaccet gt 472 



<210> X6 
20 <211=> 412 
<212> DMA 

<213> Brasaica napus 

<:220:> 

25 <22i> mxsc_feature 

«222> (1)..{412> 
<400> IS 

tttagatgat ccagtaatgt ggggagggfct caaggcgtcfc tctttcccta gatecagagc 60 
30 cggctcgatg ggttcttget gcagaggaca tttgcttcaa gaaccctgtc fctctcctaca 120 
tcttecgeac tggeaagtgt ataectataa ctagaggcgg tggaatctac caagaaeaca 1B0 
tgagtgaagc tctcgagega ttaaaagatg gatcttggtt gcataecttc ecagagggca 240 
aggtgtttca agaagatgtg cctataagac gactcaaatg gggaaccgca agcctcatcg 3 00 
cccgttgece agtcacecca atcgtettgc caataattca ccgtggcttg acgagatgaa 360 
35 rgcsegagagt acatttatgg aaaazigacca cegtaecogt t ggg aacaaa aa 412 

<210> 17 
<211> 410 
40 <212> DMA 

<213> Brass ica napus 

*220> 

«i2ai> m±sc_f eature 
45 <222> (1)..(410) 

<40Q> 17 

fcttagatgat cseagcaatigt ggggagggtt caagggtctt cttteettag atccagagct 60 
ggctcgatgg gctcntgctg cagaggacat ttgtttcaag aaccctgtct tetcctacat 12 0 
50 cttccgcact ggcaagtgta cacetacaad tagaggtggt ggaatctace aagaacacat leo 
gagtgaaget ctcgagcgat taaaagatgg atcttggttg cataecttee cagagggcaa 240 
ggtgtttcaa gaagatgtgc etataagacg acttaaatgg ggaacegaaa gecfccategc 3 00 
cegttgceca gtcaeceaaa tcgtcttgce aatatttcac cgtggfctttg acaacatgat 360 
gcccgaaaat gtcctfcttat ggaagaatga eaaecgtaoc tgtggggaan 410 



<210> is 

<211> 420 
<212> BNfc 
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<2X3> Glycine max 
<220> 

«=221> mi s c_fe atuare 
5 <:2225> (3.).. (420) 

«£400> 16 

gecattggcg ccgccgacac aaecaccaca cctttatcce etcegccgat ggcfcacctefc 60 
cctccaecat tcaacgctgg eteagccgat tccgtgatte cegcagagac ccgttgccgt 12Q 
10 egtccacetc tttttatcgc atacgagtga ttaaggattt cagttctgaa gaagattcaa ISO 
ctcttgttcg tatgatgcaa getgfctgcgg ttcctgttct tggaaafcgtc tgtcacgtgt: 240 
tcatgaacgg attaaacagr gtgcaggtat atggtctaaa aaaactgcac tccgctttac 300 
tgcaaagacc caaaggaaaa ectcfctctfca cggtcagcaa tcatgttgct tccatggatg 360 
atcetcttgt tattgcttcg etgettcecc egagtgttct tttggacgct aggaatetca 420 

15 

<i210> 19 
<211> 490 
<ai2> DMA 
20 <i2X3> Brass ica napus 

«:220» 

<221> mis cofeature 

«£222> (l).-(490) 

25 

<400> 19 

aattcctggg tcgacgattt cgtcccgaga tggtggcaag actatggget eagcaaaaag 60 
aggtattgga aggtfcgattc tggacgcaga taccetecct atggttgtcc eatttgtgca 120 
t^tctggtatg caagafcataa cgcctaeagg agccagtgtt eeacggatxg gcaaaacagt 180 

30 gaeagfcgatc attggagatic ctattccctt taafcgaeett gtagacactg aaggagccaa 240 
aeacgtttca aggaagcagt tgtatgacgc tgfcatettcc aggataggac aaagactaca 300 
ccagrtaaag caacaggttg afcaaagtafcc tctgggagca caa tat t flag aagaatcacc 360 
agcccttctt ggcaaacaaa tttcecaaac cgacgtncgt cteaatggct tggactggca 420 
tgtteretaaa agggattgce atcegaagga ageateagce tgaaggttaa gaggtttatg 480 

35 gactefcacag 490 

<:210> 20 
<211> 386 
40 <212> DNA 

<£213> Zea mays 

<22Q> 

<22l> itiise_jfeature 
45 <222> (1) . - (386) 

<400> 20 

cgtgettaga aatggaggct ggctgcatac attccctgaa ggaaaaafcag cccaagaaga 60 
tcagccgatt agaagattga agtggggaac ggccagtcfct attgtecgag cacctafcaac 120 
50 cccaatagtt ttgccaattg fctcactetgg tttcgaaaag gtcatgccag aaaactcgtc lao 
cttfcggacgg cgaccaccgg tgceactctg cagtaagaag atagacatea ttgttggaga 240 
gceaatagag tttgacttgc caagcttgaa gcaagaagca teaaeggtae cecatgactc 300 
atcetetgaa cggaaggggt ggccggccat taeaccagac gggctggacg aggcegccea 360 
gagatggcnt taecagaaga tgteag 3*5 



55 



<210> 21 
<211> 429 
<212> DNA 
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<2X2> Brassica napus 
<220> 

<22X> mis cofeature 
*222> (1) . . (429) 

<400i> 2X 



10 SSSS^^sss. 
seas sssa S Si SIS ™~ "° 

wmut nubn ^c^J SSS SagSS S££«£ iJJ 



<210> 22 
*2XX> 436 
20 <212> UNA 

<213> Brassica napua 

<220> 

<221> miec__feature 
25 «=222> (X)..(436) 



<400> 22 

tetagatgac ecactatcgt ggggagggct 
gffctcgatgg gatettgctg cagaggacat 

30 tttcegcacc gacacgcgta tacctataac 
gagtgaagct ctagagcgat taatagatgg 
gtgectacaa gacgactcaa atggggaacc 
ceaaccgfcct tgceaataat tcaccgtggt 
tatggaagaa taccaccgtt accgctgtgg 

35 ccaatcagag fcfcgatg 



ccagggtctc atfctccttag atccaaagct 60 
ttgtctcaat aaccctgtct tetcctaeat 120 
tagaggtggt ggaatctace aagaacacat lao 
atcttgcacg gcaaggcgfcc teaagaagat 240 
gcaagcctca tcagcegttg cccagtcacc 3 00 
tctgacgaga fcgatgccgga gaagtacatc 3 60 
aacaaaaacc ttaaagfcagt tgfctggtgaa 420 

436 



<2X0* 23 
<2XX> 423 
40 <212> DNA 

<2X3> Brassica napus 

<220> 

<22l> misc_feature 
45 <222> (l).-(423) 



<400> 23 

ggatgaccca tttgtcattg cttcgctact 
tttgaggtgg acgccttgtg ctacagatag 

50 ctttcgatcc gtcaaggttt tgccagttfce 
ggacattgcg atctcgaaat cgaataatgg 
tcgctcccga gatggtggca agactatggg 
tttggacgca gataccctcc ctaatgttgt 
aatgcetata ggagceagtg ttccacggat 

55 ten 



tccgcctaag ctcctaetcg atgctcgtaa 60 
argcttcaaa aacectgtaa ettcagcttt X20 
tcgcggfcgaa ggaatttatc agcagggaat lao 
aggatgggtfc cacatatttc cagaaggrag 240 
ctcagcaaaa agaggtatcg gaaggtcgat 300 
tccatttgtg cataotggta tgcaagatat, 360 
fcggeaaaaca gtgaeagtga teactggaga 420 

423 



<2X0> 24 
<21X> 400 
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<212> DNA 

<c213> Oryza sati>ra 



<220> 

<221> mise_f eature 

<222> (1) .7(400) 



<400> 24 

gcgatggcat ctaccaaaag ggaatggaca 
gggtfccatat fcttcecagaa ggaagtcgtt 
ceaagagagg fcgttggaaga ttggtaatgg 
ttgtccratae aggaatgcag gatataatge 
aaagggtgatr tgcggttgtt ggtgateeaa 
gcgatgaaac ccaacacatc tctagaggca 
ggcagagact geaggaactg aaagccgaag 



tggcactttc aaagttgaac aatggtggat go 
caaaggatgg agggaaaaec gtrcgctcctg 120 
acgtrcgaeag ecttccagfct gtaataeect 1B0 
ctgtcggaaa aegtattcca agagcaggca 240 
fccaactteaa egaccttatc atxgacaaca 3o0 
ttttgtatga caaagcaaca gaaaggattg 360 
tcgatagact 40 0 



<21Q> 2S 

<211> 4X4 

<212> DMA 

«=213> Bxassiea napus 
<220> 

«±22l> misc_jfiaature 

*222> (1> . . (414) 



<400> 2S 

ggcageaaga tetgatcact tgggaggaat 
tgctttcgeg agagcagtag ctaatctttg 
cctcatgact cttgteegtt cacgaccacc 
cagcaatgtg gggagggtte aagggtefcte 
tgcttgctgc tgaggatata tgtttcaaga 
gcaagtgtat acctataact agaggtggng 
ttgaacgatt aaaagatgga tcttggttgc 



eccaagaaaa accgtgataa cagccgttgg 60 
caacaaaacc aaagttcaca atgcagatac 120 
tggtgtcecfc ctcatcactt ttagatgatc 180 
tttctttaga tceagagtcg gcfccgafcggg 240 
actccttctt cteetaeatc ttccgcactg 300 
gaatctatca agaacacatg agtgaagctc 360 
ataccttccc agaggggeag gtgg 414 



<210> 26 
<2X1> 397 
<212> DNA 

<213s> Braasica napus 
<220> 

<i221> misc_f eacure 
<222> (1) - . (397) 



<400> 26 

etgcccccgo tttttatcgc agaagagtac 
ecatcttccg gatgcttcaa getgtggetg 
teatgaatgg tcttaacegt gtteaggtgc 
teaacagacc aaagaaeaag cctcttgtaa 
atecatttgt cafctgcttcg ttaettecfce 
ggtggaegct ttgtgct^ca gatagatgct 
gatccgtcaa agttttgcca gfcttcfccgeg 



ccaaagattt gaeggcagaa gaagagtctg 60 
ttccaettat tggaaacget tgteacgttt 120 
acggtttgga gaagttgcafc gatgctctac 180 
cggttageaa tcatgtggeg ccctcggafcg 240 
ctaagcttct acttgatgct cgtaatctga 300 
ttaagaacce tgtaacttca gctttcttttf 360 
gtgaagg 3^7 



<210> 27 
<211> 42$ 
<212> DHA> 

<i213> Brassica napua 
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<220> 

<22l> mi s c_f ea ture 

<222> (X) . . (429) 



<400> 27 

!f gaCSjat " e 5t^tccg tcaaggttte gccagtttct cgeggtgaag 60 
gaattcatea geaggcjaatg- gacattgcga tttcgaaatc gaataacaga gLiLatcc i,« 
«a2^^ cgcfccccgag atggcggcaa gattatXR fSgSaaaa III 

""""P" ttggacgeag ataecctccc tatggtlglt ceafctg^S 2" 
gcaacatata atgcctatag gagccactgt tccaeggact gacaaaaSg 30S 
tgacagtgat catcggagat cctattecet tcagtgaect tgtagaeact gaa^oateef tin 
ZSSSF aa99aaccaff ctctatcgt* caggaSJac ££££ 2! 

15 " 429 



<21D> 28 
<211> 404 
<212> DMA 
20 c2i3> Brassiea napua 



<220> 

«=22l> misc__f eature 

<222> (X) . . (404) 



c400> 28 

«taagcttc tactcgatgc tcgtaatttg aggtggacgc tttgtgctrac 60 
agatagatgc ttcaaaaacc ecgtaacttc agctttcttc cgatcegtea aggtttfcgcc 120 

r!!^^ tfcc * tca * ca gggaatggac atcgcglttc SIScSaa III 
taatggagga tgggttcaca tatfctccaga aggtagtcgc tcccgatatg gtggcaagac 240 
SSSS?* f° m ^S g grgagtcata tecgLcttl StSSSS too 

cfcttatgtaa tgegtgtgta tggaccttat tataacaeaa acaagctcgt gatteafttc III 
tttgtgcaag atgatttctc nctcagatae catgcgtatg aatg^ * atteact " 



<210> 29 
<211> 467 
*212> DNX 

<2X3> Brassiea napus 
<220> 

<221> m±sc_f eature 
<222> (1). 7(467) 

45 <400> 29 

gaattcfccgg gtcgacgata gtgcaaattt agatgatcca gtaatgtggg gagggtccaa 60 
ggtcttcttt ccttagatce agagctggct cgatgggtac ttgctgcaga ggacatttgt 120 
ttcaagaacc ctgccttcte ecacatcttc cgcaetggca agtgtataee tataactaga iao 
HSTtggtggaa tctaccaaga acacatgagt gaagctcccg agcgattaaa agatggatct 240 
tggttgcata cctteccaca gggcacggtg ttacacgatg atgtgcefcag ctgacgaett 300 
acatggggaa ccggcggcct aatcccgcgt tgaccaaeca cgecaarfccx cttgccaata 360 
tttcacggeg actgfcgacga cateatgacg eagaaggcca tggatctata aacaeeaeicg 420 
ctacctccct tgateaaaac cgcaaacgta gaggaggeta accctcn 457 



<2X0» 30 
*211> 4S9 
<212> 

<213> Brasaica nap-as 



29-^1 S002/Hl/9i:ire2^dug 
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<220> 

<22l> miae_ difference 
<222> (1).-(4S9) 

5 

<AQO> 30 

gatcaccgtg gtagagccgc ggfctttafcat aeggcacggt atagegttct cttctccatc 60 
caccgettgg etgctcgatt ccggaacttc egccgcgag* ctetccetfcc tgcccecgec 120 
ttfctatcgca gaagagtacc taaagattfcg aeggcagaag aagagtcfcgc tatcfcfcccgg lflO . 
10 atgcttcaag Ctgtggecgfc tceacttafct ggaaaegctc gtcafcgttfct catgaatggt 240 
cttaaccgtg tteaggtgta fcggtttagag aagecgcatg atgetcfcgct caacaggcca 3 00 
aagaacaage ccctcgtaac ggttagcaat catgtggcat ccttggatga tccatttgtc 360 
attgcfctcgt taettccgcc fcaagcfctcta etcgatgctc ggaatctgag gtggacgett 420 
tgggetacac acagatggtt: taccaaccct gtgcttceg ~ 450 

1 5 

<210> 31 
<211> 389 
<212> DNA 
20 <213> Glycine max 

<220> 

c221> mis cofeature 
<222> (1) .7{3B3) 

25 

<400> 31 

ggggtactgc geecgcaatc eccggnccgg accaceattg gcgcegcgac accaccacac 60 
ctttatcccc tetgccgatg gctacttctc ctcgaceatt caacgctgcc tcagccgatt 120 
tcgtgatttc cgaaggtatt cattgccttc ttccacctct ctccatcgta aacgagtg-at ISO 
30 taaggatxte agttctgagg aagattcagc tcttgfctcgg aegatgcaag ctgttgcggt 240 
teetgttctt ggaaatgfcet gtcacgtgtt tacgaacgga fctaaaccagg tgcaggtgta 300 
fcggtfctagaa aaactgcact ccgcgttgct gcatagacct aagggeaaac ctcttcttac 360 
ggtcageaat cangttgctt ccatggatg 389 

35 

<210> 32 
<211> 400 

<2i2> m 

<213> Orysa sativa 
<220> 

<221> miac_f eatuare 
<222> (1) .7(400) 

45 <400> 32 

agaaaaacfcg cactecgetc tactgcaaag acctaaagga aaacctcttc ttacggtcag 60 
caaccatgtt gctxccatgg atgatcctct tgttattget tegctgcttc cteegagrgt 120 
tcttttggac gctaggaatc tcagatggac gctttgcgca actgataggt gttttaaaaa 180 
ccctgtgacc tctgcattct ttcgatcagt caaagttttg ccagtttctc gaggtgatgg 240 
50 catctatcaa gaaggaatgg acttggccat atcaaaattg aaccatggtg gttgggtcca 300 
gatattccca cacggeggtt gatccetcta cttttcaaaa tcagaaagtt aaaataaggg 360 
agggggcghc gaaaaatcca agcggggage gggceccttg 40 o 

55 <i210> 33 
<211> 443 
<212> DNA 

<213> Brassica napus 
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<220> 

<22l> mi a c_£ eature 
<222> (1) > . (445) 

e400> 33 

aatteccggg ccgacgacca ccgtggcaga gccgcggttt tatatacgga cggttacttc 60 
tcctccteca tccaccgctt ggctgetega ttceggaact tccgccgcga gtctctccct 120 
tctgcccecg ctttttatcg cagaagagta cctaaagatt tgaeggcaga agaagagtct 180 
gctatettce ggatgettea agctgtgget gttccactta ttggaaacge ttgccatgtt 240 
ttcatgaatg gtcttaaccg tgttcaggtg tatggtttgg agaagttgea tgatgettta 300 
ctcaacagac caaagaacaa gectcttgta aeggttagea ateatgtggc gtccttggat 360 
gatccatttg teattgette gttacttcct cctaagctte tacttgatgc tegtaatctcr 420 
aggtggacgc tntgtgctac agatagatg 44B 

<210> 34 
<211* 429 
<212> 

<2i3=» Oryza sativa 
<r220> 

<22l> mis cofeature 
<222* (1) . . (429) 

25 *400> 34 

ccgggatggc ggaaaaaeca tgggctctte caagagaggt gttgggaggt tagtcctgga £0 
tggagatagc atgectgttg ttgrcccatt tgtacataca gggatgeagg agattatgee 120 
tgtaggtgct aactttecca gaataggcaa gatggttaca gtgetcatag gtgatecgat lao 
eaattttgat gatataattg aatttgaeaa agaeanaggc tcaaatgtgc eeagaagaeg 240 
actatatgat gcagtagcat ctaaaattgg tgatcggttg cttgagatga aggtccaggt 3 00 
tgaeactafcc gcaattgtca agaaatgeag gtaccagaaa agtcetcaca eagaetgace 360 
gaceartaaa aaactgagce aggtgattgg gactaatttg aatggaeatc ttctggccgc 420 
agaaatgee ^ a 

35 

<210> 35 
<211^ 449 
<2122> DNA. 

<2i3> Brassica uapus 

40 

<220> 

<221> miac feature 
<222> (1) .7(449) 

45 <400* 35 

aattcccggg tcgacgatca ccgtggcaga geegeggttt tatatacgga eggttactcc 60 
tcctcctcca tccaccgctt ggctgetega ttceggaact tccgccgcga gtctctccct 120 
tctgcccccg ctttttaceg cagaagagta cctaaagatt tgaeggcaga agaagagtct lao 
gctatettce ggatgettea agctgtgget gctccactta ttggaaacgc ttgtcatgtt 24 o 
50 ttcatgaatg gtcttaaccg tgttcaggtg tatggtttgg agaagttgea tgatgettta 300 
ctcaacagac caaagaacaa gcctcttgta aeggttagea atcatgtggc gnccttggat 360 
gatccatttg teattgette gttacttcct cetaagcctc tacttgatgc ccgtaatctg 420 
aggtggacgc tntgtgctac agatagatg 449 

55 
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